
warf Sequence.ST25.txt 
SEQUENCE LISTING 



<110> Harms, Jerome S 
Splitter, Gary A 
Eakle, Kurt A 
Bremel , Robert D 

<120> inducible Protein Expression System 

<130> 2240/3 

<140> US 10/763,976 
<141> 2004-01-23 

<160> 65 

<170> Patentln version 3.3 

<210> 1 

<211> 576 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered Sequence from virus and plasmid 



<220> 

<221> Promoter 
<222> (87). .(432) 
<223> BLV Promoter 












<400> 1 
aggaaaccag 


cagcggctat 


ccgcgcatcc 


atgcccccga 


actgcaggag 


tggggaggca 


60 


cgatggccgc 


tttggtcgag 


gcggatccta 


gcagaaaaat 


aagacttgat 


tcccccttaa 


120 


aattacaact 


gctagaaaat 


gaatggctct 


cccgcctttt 


ttgaggggga 


atcatttgta 


180 


tgaaagatca 


tgccgaccta 


ggcgccgcca 


ccgccccgta 


aaccagacag 


agacgtcagc 


240 


tgccagaaaa 


gctggtgacg 


gcagctggtg 


gctagaatcc 


ccgtacctcc 


ccaacttccc 


300 


ctttcccgaa 


aaatccacac 


cctgagctgc 


tgacctcacc 


tgctgataaa 


ttaataaaat 


360 


gccggccctg 


tcgagttagc 


ggcaccagaa 


gcgttcttct 


cctgagaccc 


tcgtgctcag 


420 


ctctcggtcc 


tgcctcgaga 


agcttgttat 


cacaagtttg 


tacaaaaaag 


ctgaacgaga 


480 


aacgtaaaat 


gatataaata 


tcaatatatt 


aaattagatt 


ttgcataaaa 


aacagactac 


540 


ataatactgt 


aaaacacaac 


atatccagtc 


actatg 






576 



<210> 2 

<211> 930 

<212> DNA 

<213> Bovine leukemia virus 



<220> 

<221> CDS 

<222> (1)..(930) 
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Warf Sequence.ST25.txt 



<400> 2 

atg gca agt gtt gtt gqt tgg gqg ccc cac tct eta cat gec tgc ccg 48 

Met Ala Ser val val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 

15 10 15 

gec ctg gtt ttg tec aat gac gtc acc ate gat gec tgg tgc ccc etc 96 

Ala Leu val Leu Ser Asn Asp Val Thr lie Asp Ala Trp Cys Pro Leu 

20 25 30 

tgc gqg ccc cat gag cga etc caa ttc gaa agg ate gac acc acg cac 144 

cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr His 

35 40 45 

acc tgc gag acc cac cgt ate acc tgg acc gec gat gga cga cct ttc 192 

Thr Cys Glu Thr His Arg lie Thr Trp Thr Ala Asp Gly Arg Pro Phe 

50 55 60 

ggc etc aat gga gcg ctg ttc cct cga ctg cat gtc tec aga gac ccg 240 

Gly Leu Asn Gly Ala Leu Phe Pro Arg Leu His val Ser Arg Asp Pro 

65 70 75 80 

gee cca agg gee cga cga etc tgg ate aac tgc ccc ctt ccg gec gtt 288 

Ala Pro Arg Ala Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala val 

85 " 90 95 

cgc get cag ccc ggc ccg gtt tea ctt tec ccc ttc gag egg tec ccc 336 

Arg Ala Gin Pro Gly Pro val Ser Leu Ser Pro Phe Glu Arg Ser Pro 

100 105 110 

ttc cag ccc tac caa tgc caa ttg ccc teg gec tct age gac gqt tgc 384 

Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 

115 120 125 

ccc gtc ate gqg cac ggc ctt ctt ccc tgg aac aac tta gta acg cat 432 

Pro Val lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu val Thr His 

130 135 140 

cct tgt cct egg aaa gtc ctt ata tta aat caa atg gee aat ttt tec 480 

Pro Cys Pro Arg Lys val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 

145 150 155 160 

tta etc ccc ccc ttc aat acc etc ctt gtq gac ccc etc egg ttg tec 528 

Leu Leu Pro Pro Phe Asn Thr Leu Leu val Asp Pro Leu Arg Leu ser 

165 170 175 

gtc ttt gee cca gac acc agg gga gec ata cgt tat etc tec acc ctt 576 

val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 

180 185 190 

ttg acg eta tgc cca get act tgt att eta ccc etc ggc gag ccc ttc 624 

Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 

195 200 205 

tct cct aat gtc ccc ata tgt cgc ttt ccc egg gac tec aat gaa ccc 672 

Ser Pro Asn val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 

210 215 ~ 220 

ccc ctt tea gaa ttc gag ctg ccc ctt ate caa acg ccc ggc ctg tct 720 

Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 

225 230 235 240 

tgg tct gtc ccc gcg ate gac eta ttc eta acc ggc ccc cct tec cca 768 
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warf Sequence.ST25.txt 
Trp Ser val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 

tgc gac egg tta cac gta tgg tec agt cct cag gec tta cag cgc ttc 816 
Cys Asp Arg Leu His val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 

etc cat gac cct acg eta acc tgg tea gaa ttg gtt get age agg aaa 864 
Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu Val Ala Ser Arg Lys 
275 280 285 

eta aga ctt gat tea ccc tta aaa tta caa ctg tta gaa aat gaa tgg 912 
Leu Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 

etc tec cgc ctt ttt tga 930 

Leu ser Arg Leu Phe 

305 

<210> 3 
<211> 309 
<212> PRT 

<213> Bovine leukemia virus 
<400> 3 

Met Ala Ser Val Val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 
1 5 10 15 

Ala Leu val Leu Ser Asn Asp val Thr lie Asp Ala Trp Cys Pro Leu 
20 25 30 

Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr His 
35 40 45 

Thr Cys Glu Thr His Arg lie Thr Trp Thr Ala Asp Gly Arg Pro Phe 
50 55 60 

Gly Leu Asn Gly Ala Leu Phe Pro Arg Leu His Val Ser Arg Asp Pro 
65 70 75 80 

Ala Pro Arg Ala Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala Val 
85 90 ' 95 

Arg Ala Gin Pro Gly Pro val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 

Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 

Pro val lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu val Thr His 
130 135 140 
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Warf Sequence.ST25.txt 
Pro Cys Pro Arg Lys Val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 
145 150 155 160 



Leu Leu Pro Pro Phe Asn Thr Leu Leu val Asp Pro Leu Arg Leu Ser 
165 170 175 

val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 
180 185 190 

Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 
195 200 205 

Ser Pro Asn val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 220 

Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 
225 230 235 240 

Trp Ser val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 

Cys Asp Arg Leu His Val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 



Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu val Ala Ser Arg Lys 
275 280 285 

Leu Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 

Leu Ser Arg Leu Phe 
305 



<210> 4 

<211> 1062 

<212> DNA 

<213> Human T-cell lymphotropi c virus type 1 



<220> 

<221> promoter 

<222> (D..C353) 

<400> 4 



atggcccact 


tcccagggtt 


tggacagagt 


cttcttttcg 


gatacccagt 


ctacgtgttt 


60 


ggagacggcg 


actggtgccc 


ctgtgtacaa 


atctctgggg 


gactatgttc 


ggcccgccta 


120 


catcgtcacg 


ccctactggc 


cacctgtcca 


gagcatcaga 


tcacctggga 


ccccatcgat 


180 


ggacgcgtta 


tcggctcagc 


tctacagttc 


cttatccctc 


gactcccctc 


cttccccacc 


240 
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cagagaacct 


ctaagaccct 


warf Sequence.ST25.txt 
caaggtcctt accccgccaa tcactcatac 


aacccccaac 


300 


attccaccct 


ccttcctcca 


ggccatgcgc 


aaatactccc 


ccttccgaaa 


tggatacatg 


360 


gaacccaccc 


ttgggcagca 


cctcccaacc 


ctgtcttttc 


cagaccccgg 


actccggccc 


420 


caaaacctgt 


acaccctctg 


gggaggctcc 


gttgtctgca 


tgtacctcta 


ccagctttcc 


480 


ccccccatca 


cctggcccct 


cctgccccac 


gtgatttttt 


gccaccccgg 


ccagctcggg 


540 


gccttcctca 


ccaatgttcc 


ctacaagcga 


atagaagaac 


tcctctataa 


aatttccctt 


600 


accacagggg 


ccctaataat 


tctacccgaa 


gactgtttgc 


ccaccaccct 


tttccagcct 


660 


gttagggcac 


ccgtcacgct 


aacagcctgg 


caaaacggcc 


tccttccgtt 


ccactcaacc 


720 


ctcaccactc 


caggccttat 


ttggacattt 


accgatggca 


cgcctatgat 


ttccgggccc 


780 


tgccctaaag 


atggccagcc 


atctttagta 


ctacagtcct 


cctcctttat 


atttcacaaa 


840 


L L L LdaaLLa 




cccctcattt 


ctactctcac 


acggcctcat 


QLCiy LCI V. l_ v_ l_ 


900 


tcctttcata 


atttacatct 


cctgtttgaa 


gaatacacca 


acatccccat 


ttctctactt 


960 


tttaacaaaa 


aagaggcaga 


tgacaatgac 


catgagcccc 


aaatatcccc 


cgggggctta 


1020 


gagcctccca 


gtgaaaaaca 


tttccgcgaa 


acagaagtct 


ga 




1062 


<210> 5 
<211> 353 
<212> DNA 

<213> Human T-cell lymphotropi c 


vi rus type 


i 






<400> 5 
tgacaatgac 


catgagcccc 


aaatatcccc 


cgggggctta 


gagcctctca 


gtgaaaaaca 


60 


tttccgtgaa 


acagaagtct 


gagaaggtca 


gggcccagaa 


taaggctctg 


acgtctcccc 


120 


ccggaggaca 


gctcagcacc 


agctcaggct 


aggccctgac 


gtgtccccct 


aaagacaaat 


180 


cataagctca 


gacctccggg 


aagccaccgg 


gaaccaccca 


tttcctcccc 


atgtttgtca 


240 


agccgtcctc 


aggcgttgac 


gacaacccct 


cacctcaaaa 


aacttttcat 


ggcacgcata 


300 


cggctcaata 


aaataacagg 


agtctataaa 


agcgtgggga 


cagttcagga 


ggg 


353 



<210> 6 

<211> 456 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<220> 

<221> CDS 

<222> (1)..(456) 

<400> 6 

ctg gaa ggg eta att tgg tec caa aga aga caa gag ate ctt gat ctg 48 

Leu Glu Gly Leu lie Trp ser Gin Arg Arg Gin Glu lie Leu Asp Leu 
15 10 15 
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tgg ate tac cac aca caa ggc tac ttc cct gat tgg cag aat tac aca 96 

Trp lie Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 

20 25 30 

cca ggg cca ggg ate aga tat cca ctg acc ttt gqa tgg tgc ttc aag 144 

Pro Gly Pro Gly lie Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys 

35 40 45 

eta gta cca gtt gag cca gag aag gta gaa gag gec aat gaa gqa gag 192 

Leu val Pro val Glu Pro Glu Lys val Glu Glu Ala Asn Glu Gly Glu 

50 55 60 

aac aac age ttg tta cac cct atg age ctg cat gqg atg gag gac gcg 240 

Asn Asn Ser Leu Leu His Pro Met Ser Leu His Gly Met Glu Asp Ala 

65 70 75 80 

gag aaa gaa gtg tta gtg tgg agg ttt gac age aaa eta gca ttt cat 288 

Glu Lys Glu val Leu val Trp Arg Phe Asp Ser Lys Leu Ala Phe His 

85 ~ 90 95 

cac atg gee cga gag ctg cat ccg gag tac tac aaa gac tgc tga cat 336 

His Met Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys His 

100 105 ' 110 



cga get ttc tac aag gqa ctt tec get gqg gac ttt cca gqg agg cgt 

Arg Ala Phe Tyr Lys Gly Leu Ser Ala Gly Asp Phe Pro Gly Arg Arg 

115 120 125 

ggc ctg ggc ggg act gqg gag tgg cgt ccc tea gat get gca tat aag 

Gly Leu Gly Gly Thr Gly Glu Trp Arg Pro Ser Asp Ala Ala Tyr Lys 

130 135 140 



384 



432 



cag ctg ctt ttt gee tgt act gqg 456 
Gin Leu Leu Phe Ala Cys Thr Gly 
145 150 

<210> 7 

<211> 110 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 7 

Leu Glu Gly Leu lie Trp Ser Gin Arg Arg Gin Glu lie Leu Asp Leu 
1 5 10 15 

Trp lie Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 
20 25 30 

Pro Gly Pro Gly lie Arg Tyr Pro Leu Thr Phe Gly Trp Cys Phe Lys 
35 40 45 

Leu Val Pro Val Glu Pro Glu Lys val Glu Glu Ala Asn Glu Gly Glu 
50 55 60 

Asn Asn Ser Leu Leu His Pro Met ser Leu His Gly Met Glu Asp Ala 
65 70 75 80 
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Glu Lys Glu val Leu val Trp Arg Phe Asp Ser Lys Leu Ala Phe His 
85 90 95 

His Met Ala Arg Glu Leu His Pro Glu Tyr Tyr Lys Asp Cys 
100 105 * 110 

<210> 8 
<211> 41 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 8 

His Arg Ala Phe Tyr Lys Gly Leu Ser Ala Gly Asp Phe Pro Gly Arg 
1 5 10 15 

Arg Gly Leu Gly Gly Thr Gly Glu Trp Arg Pro Ser Asp Ala Ala Tyr 
20 25 30 

Lys Gin Leu Leu Phe Ala Cys Thr Gly 
35 40 

<210> 9 
<211> 306 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<220> 

<221> CDS 

<222> (1). .(303) 

<400> 9 

atg gag cca gta gat cct aat eta gag ccc tgg aag cat cca gga agt 48 

Met Glu Pro Val Asp Pro Asn Leu Glu Pro Trp Lys His Pro Gly Ser 
1 5 10 15 

cag cct agg act get tgt aac aat tgc tat tgt aaa aag tgt tgc ttt 96 
Gin Pro Arg Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 



cat tgc tac gcg tgt ttc aca aga aaa gqc tta gqc ate tec tat gqc 
His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 
35 40 45 



144 



agg aag aag egg aga cag cga cga aga get cct cag gac agt cag act 192 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 

50 55 60 

cat caa get tct eta tea aag caa ccc gee tec cag tec cga ggg gac 240 

His Gin Ala Ser Leu Ser Lys Gin Pro Ala Ser Gin Ser Arg Gly Asp 

65 70 75 80 

ccg aca gqc ccg acg gaa teg aag aag aag gtq gag aga gag aca gag 288 

Pro Thr Gly Pro Thr Glu Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 

85 90 " 95 
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aca gat ccg ttc gat tag 
Thr Asp Pro Phe Asp 
100 



warf Sequence.ST25.txt 



306 



<210> 10 

<211> 101 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 10 

Met Glu Pro Val Asp Pro Asn Leu Glu Pro Trp Lys His Pro Gly Ser 
1 5 10 15 

Gin Pro Arg Thr Ala cys Asn Asn cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 

His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 
50 55 ~ 60 

His Gin Ala Ser Leu Ser Lys Gin Pro Ala Ser Gin Ser Arg Gly Asp 
65 70 75 ~ 80 

Pro Thr Gly Pro Thr Glu Ser Lys Lys Lys val Glu Arg Glu Thr Glu 
85 90 95 



Thr Asp Pro Phe Asp 
100 



<210> 11 

<211> 309 

<212> PRT 

<213> Bovine leukemia virus 

<400> 11 

Met Ala ser val val Gly Trp Gly Pro His ser Leu His Ala Cys Pro 
1 5 10 15 

Ala Leu Val Leu Ser Asn Asp val Thr lie Asp Ala Trp Cys Pro Leu 
20 25 30 

Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr His 
35 40 45 

Thr Cys Glu Thr His Arg lie Thr Trp Thr Ala Asp Gly Arg Pro Phe 
50 55 60 
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warf 5equence.ST25.txt 
Gly Leu Asn Gly Ala Leu Phe Pro Arg Leu His val Ser Arg Asp Pro 
65 70 75 80 

Ala Pro Arg Ala Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala val 
85 90 95 

Arg Ala Gin Pro Gly Pro Val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 

Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 

Pro Val lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu Val Thr His 
130 ' 135 140 

Pro Cys Pro Arg Lys val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 
145 150 155 160 

Leu Leu Pro Pro Phe Asn Thr Leu Leu Val Asp Pro Leu Arg Leu Ser 
165 170 175 

val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 
180 ~ 185 190 

Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 
195 200 205 

Ser Pro Asn val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 220 

Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 
225 230 235 240 

Trp Ser val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 

Cys Asp Arg Leu His Val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 

Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu Val Ala Ser Arg Lys 
275 280 285 

Leu Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
290 295 300 

Leu Ser Arg Leu Phe 
305 
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<210> 12 
<211> 353 
<212> PRT 

<213> Human T-cell lymphotropi c virus type 1 
<400> 12 

Met Ala His Phe Pro Gly Phe Gly Gin Ser Leu Leu Phe Gly Tyr Pro 
1 5 10 15 

val Tyr Val Phe Gly Asp Cys val Gin Gly Asp Trp Cys Pro lie Ser 
20 25 30 

Gly Gly Leu Cys Ser Ala Arg Leu His Arg His Ala Leu Leu Ala Thr 
35 40 45 

Cys Pro Glu His Gin lie Thr Trp Asp Pro lie Asp Gly Arg Val lie 
50 55 60 

Gly ser Ala Leu Gin Phe Leu lie Pro Arg Leu Pro Ser Phe Pro Thr 
65 70 75 80 

Gin Arg Thr Ser Lys Thr Leu Lys Val Leu Thr Pro Pro lie Thr His 
85 90 95 

Thr Thr Pro Asn lie Pro Pro Ser Phe Leu Gin Ala Met Arg Lys Tyr 
100 105 110 

Ser Pro Phe Arg Asn Gly Tyr Met Glu Pro Thr Leu Gly Gin His Leu 
115 120 125 

Pro Thr Leu Ser Phe Pro Asp Pro Gly Leu Arg Pro Gin Asn Leu Tyr 
130 135 140 

Thr Leu Trp Gly Gly Ser val val Cys Met Tyr Leu Tyr Gin Leu Ser 
145 150 155 * 160 

Pro Pro lie Thr Trp Pro Leu Leu Pro His val lie Phe Cys His Pro 
165 170 175 

Gly Gin Leu Gly Ala Phe Leu Thr Asn val Pro Tyr Lys Arg lie Glu 
180 185 190 

Glu Leu Leu Tyr Lys lie Ser Leu Thr Thr Gly Ala Leu lie lie Leu 
195 200 205 

Pro Glu Asp Cys Leu Pro Thr Thr Leu Phe Gin Pro val Arg Ala Pro 
210 215 220 
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Val Thr Leu Thr Ala Trp Gin Asn Gly Leu Leu Pro Phe His Ser Thr 
225 230 235 240 

Leu Thr Thr Pro Gly Leu lie Trp Thr Phe Thr Asp Gly Thr Pro Met 
245 250 255 

lie Ser Gly Pro Cys Pro Lys Asp Gly Gin Pro Ser Leu Val Leu Gin 
260 265 270 

Ser Ser Ser Phe lie Phe His Lys Phe Gin Thr Lys Ala Tyr His Pro 
275 280 285 

Ser Phe Leu Leu Ser His Gly Leu lie Gin Tyr Ser Ser Phe His Asn 
290 295 300 

Leu His Leu Leu Phe Glu Glu Tyr Thr Asn lie Pro lie Ser Leu Leu 
305 310 315 320 

Phe Asn Lys Lys Glu Ala Asp Asp Asn Asp His Glu Pro Gin lie Ser 
325 330 335 

Pro Gly Gly Leu Glu Pro Pro Ser Glu Lys His Phe Arg Glu Thr Glu 
340 345 350 

val 



<210> 13 

<211> 101 

<212> PRT 

<213> Human T-cell lymphotropic virus type 1 

<400> 13 

Met Glu Pro val Asp Pro Asn Leu Glu Pro Trp Lys His Pro Gly Ser 
1 5 10 15 

Gin Pro Arg Thr Ala Cys Asn Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 

His Cys Tyr Ala Cys Phe Thr Arg Lys Gly Leu Gly lie Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro Gin Asp Ser Gin Thr 
50 ~ " 55 60 

His Gin Ala Ser Leu ser Lys Gin Pro Ala Ser Gin ser Arg Gly Asp 
65 70 75 80 
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Pro Thr Gly Pro Thr Glu Ser Lys Lys Lys Val Glu Arg Glu Thr Glu 
85 90 95 



Thr Asp Pro Phe Asp 
100 



<210> 14 

<211> 7685 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered Sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149). .(737) 

<223> 5' MOMUSVLTR 

<220> 

<221> CDS 

<222> (1753).. (2148) 

<223> Blasticidin Resistance 

<220> 

<221> promoter 

<222> (2257).. (3074) 

<223> CMV IE promoter 

<220> 

<221> misc_recomb 

<222> (3078).. (3102) 

<223> attBl 

<220> 

<221> CDS 

<222> (3115).. (4041) 

<223> BLV Tax 

<220> 

<221> misc_recomb 

<222> (4046) .. (4070) 

<223> attB2 

<220> 

<221> misc_signal 

<222> (4082). .(4674) 

<223> wpre; woodchuck hepatitis virus post-transcriptional regulatory 
element 

<220> 

<221> LTR 

<222> (4720).. (5313) 

<223> 3' MoMuLvLTR 

<220> 

<221> CDS 

<222> (6616).. (7476) 

<223> Ampicillin Resistance 
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<400> 14 
gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaggcatgg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 


cgcgcttccg 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


gacgggggtc 


tttcatttgg 


gggctcgtcc 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccgggaggta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tgtctatgtt 


tgatgttatg 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


ggcggacccg 


tggtggaact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccagggactt 


tgggggccgt 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


cgaccccgtc 


aggatatgtg 


gttctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


cgtctgaatt 


tttgctttcg 


gtttggaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


gcagcatcgt 


tctgtgttgt 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


gccagactgt 


taccactccc 


ttaagtttga 


ccttaggtca 


ctggaaagat 


gtcgagcgga 


1260 


tcgctcacaa 


ccagtcggta 


gatgtcaaga 


agagacgttg 


ggttaccttc 


tgctctgcag 


1320 


aatggccaac 


ctttaacgtc 


ggatggccgc 


gagacggcac 


ctttaaccga 


gacctcatca 


1380 


cccaggttaa 


gatcaaggtc 


ttttcacctg 


gcccgcatgg 


acacccagac 


caggtcccct 


1440 


acatcgtgac 


ctgggaagcc 


ttggcttttg 


acccccctcc 


ctgggtcaag 


ccctttgtac 


1500 


accctaagcc 


tccgcctcct 


cttcctccat 


ccgccccgtc 


tctccccctt 


gaacctcctc 


1560 


gttcgacccc 


gcctcgatcc tccctttatc 


cagccctcac 


tccttctcta 


ggcgccggaa 


1620 


ttccgatctg 


atcaagagac 


aggatgaggg 


agcttgtata 


tccattttcg 


gatctgatca 


1680 


gcacgtgttg 


acaattaatc 


atcggcatag 


tatatcggca 


tagtataata 


cgacaaggtg 


1740 



aggaactaaa cc atg gcc aag cct ttg tct caa gaa gaa tec acc etc att 1791 

Met Ala Lys Pro Leu Ser Gin Glu Glu Ser Thr Leu lie 

1 5 10 
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gaa aga gca acg get aca ate aac age ate ccc ate tct gaa gac tac 1839 

Glu Arg Ala Thr Ala Thr lie Asn ser lie Pro lie Ser Glu Asp Tyr 

15 20 25 

age gtc gee age gca get etc tct age gac gqc cgc ate ttc act ggt 1887 

Ser val Ala Ser Ala Ala Leu Ser Ser Asp Gly Arg lie Phe Thr Gly 

30 35 40 " 45 

gtc aat gta tat cat ttt act ggg gqa cct tgt gca gaa etc gtq gtq 1935 

val Asn val Tyr His Phe Thr Gly Gly Pro Cys Ala Glu Leu val val 

50 55 60 

ctg gqc act get get get gcg gca get ggc aac ctg act tgt ate gtc 1983 

Leu Gly Thr Ala Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys lie Val 

65 70 75 

gcg ate gqa aat gag aac agg gqc ate ttg age ccc tgc gqa egg tgt 2031 

Ala lie Gly Asn Glu Asn Arg Gly lie Leu Ser Pro Cys Gly Arg Cys 

80 85 90 

cga cag gtq ctt etc gat ctg cat cct ggg ate aaa gcg ata gtg aag 2079 

Arg Gin Val Leu Leu Asp Leu His Pro Gly lie Lys Ala lie Val Lys 

95 100 105 

gac agt gat gqa cag ccg acg gca gtt ggg att cgt gaa ttg ctg ccc 2127 

Asp Ser Asp Gly Gin Pro Thr Ala val Gly lie Arg Glu Leu Leu Pro 

110 115 120 125 



tct ggt tat gtg tgg gag gqc taagcacttc gtggccgagg agcaggactg 2178 
Ser Gly Tyr val Trp Glu Gly 
130 



acaegtgeta 


cgagatttcg 


attccaccgc 


cgccttctat 


gaaaggttgg 


getteggaat 


2238 


cgttttccgg 


gacgccgatc 


eggecattag 


ccatattatt 


cattggttat 


atagcataaa 


2298 


tcaatattgg 


etattggeca 


ttgcatacgt 


tgtatccata 


tcataatatg 


tacatttata 


2358 


ttggctcatg 


tccaacatta 


ccgccatgtt 


gacattgatt 


attgactagt 


tattaatagt 


2418 


aatcaattac 


ggggtcatta 


gttcatagee 


catatatgga 


gttccgcgtt 


acataactta 


2478 


eggtaaatgg 


cccgcctggc 


tgaccgccca 


acgacccccg 


cccattgacg 


tcaataatga 


2538 


cgtatgttcc 


catagtaacg 


ccaataggga 


ctttccattg 


aegtcaatgg 


gtggagtatt 


2598 


tacggtaaac 


tgcccacttg 


gcagtacatc 


aagtgtatca 


tatgecaagt 


acgcccccta 


2658 


ttgacgtcaa 


tgacggtaaa 


tggcccgcct 


ggcattatgc 


ccagtacatg 


accttatggg 


2718 


actttcctac 


ttggcagtac 


atetaegtat 


tagtcatege 


tattaccatg 


gtgatgcggt 


2778 


tttggcagta 


catcaatggg 


cgtggatagc 


ggtttgactc 


aeggggattt 


ccaagtctcc 


2838 


accccattga 


cgtcaatggg 


agtttgtttt 


ggcaccaaaa 


teaaegggae 


tttccaaaat 


2898 


gtcgtaacaa 


ctccgcccca 


ttgacgcaaa 


tgggeggtag 


geatgtaegg 


tgggaggtct 


2958 


atataagcag 


agetegttta 


gtgaaccgtc 


agatcgectg 


gagacgecat 


ccacgctgtt 


3018 


ttgacctcca 


tagaagacac 


egggaccgat 


ccagcctccg 


cggccccaag 


cttgttatca 


3078 
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caagtttgta caaaaaagca ggctcccgcc gccacc atg gca agt gtt gtt gqt 3132 

Met Ala Ser val val Gly 
135 

tgg ggg ccc cac tct eta cat gec tgc ccg gec ctg gtt ttg tec aat 3180 
Trp Gly Pro His Ser Leu His Ala Cys Pro Ala Leu val Leu ser Asn 
140 145 150 

gat gtc acc ate gat gec tgg tgc ccc etc tgc ggg ccc cat gag cga 3228 
Asp Val Thr lie Asp Ala Trp Cys Pro Leu Cys Gly Pro His Glu Arg 
155 160 165 170 

etc caa ttc gaa agg ate gac acc acg etc acc tgc gag acc cac cgt 3276 
Leu Gin Phe Glu Arg lie Asp Thr Thr Leu Thr Cys Glu Thr His Arg 
175 180 185 

ate aac tgg acc gee gat gqa cga cct tgc gqc etc aat gqa acg ttg 3324 
lie Asn Trp Thr Ala Asp Gly Arg Pro Cys Gly Leu Asn Gly Thr Leu 
190 195 200 

ttc cct cga ctg cat gtc tec gag acc cgc ccc caa ggg ccc cga cga 3372 
Phe Pro Arg Leu His val Ser Glu Thr Arg Pro Gin Gly Pro Arg Arg 
205 210 215 

etc tgg ate aac tgc ccc ctt ccg gee gtt cgc get cag ccc gqc ccg 3420 
Leu Trp lie Asn Cys Pro Leu Pro Ala val Arg Ala Gin Pro Gly Pro 
220 225 230 

gtt tea ctt tec ccc ttc gag egg tec ccc ttc cag ccc tac caa tgc 3468 
val Ser Leu Ser Pro Phe Glu Arg Ser Pro Phe Gin Pro Tyr Gin Cys 
235 240 245 ' 250 

caa ttg ccc teg gec tct age gac gqt tgc ccc att ate ggg cac gqc 3516 
Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys Pro lie lie Gly His Gly 
255 260 265 

ctt ctt ccc tgg aac aac tta gta acg cat cct gtc etc aga aaa gtc 3564 
Leu Leu Pro Trp Asn Asn Leu val Thr His Pro Val Leu Arg Lys Val 
270 275 280 

ctt ata tta aat caa atg gee aat ttt tec tta etc ccc tec ttc gat 3612 
Leu lie Leu Asn Gin Met Ala Asn Phe Ser Leu Leu Pro Ser Phe Asp 
285 290 295 

acc etc ctt gtg gac ccc etc egg ctg tec gtc ttt gee cca gac acc 3660 
Thr Leu Leu val Asp Pro Leu Arg Leu Ser val Phe Ala Pro Asp Thr 
300 305 " 310 

agg gqa gee ata cgt tat etc tec acc ctt ttg acg eta tgc ccg get 3708 
Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu Leu Thr Leu Cys Pro Ala 
315 320 325 330 

act tgt att eta ccc eta gqc gag ccc ttc tct cct aat gtc ccc ata 3756 
Thr Cys lie Leu Pro Leu Gly Glu Pro Phe Ser Pro Asn val Pro lie 
335 340 345 

tgc cgc ttt ccc egg gac tec aat gaa ccc ccc ctt tea gaa ttc gag 3804 
Cys Arg Phe Pro Arg Asp ser Asn Glu Pro Pro Leu Ser Glu Phe Glu 
350 355 360 

ctg ccc ctt ate caa acg ccc gqc ctg tct tgg tct gtc ccc gcg ate 3852 
Leu Pro Leu lie Gin Thr pro Gly Leu Ser Trp Ser Val Pro Ala lie 
365 370 375 
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gac eta ttc eta acc ggt ccc cct tec cca tgc gac egg tta cac gta 3900 

Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro Cys Asp Arg Leu His val 

380 385 390 

tgg tec agt cct cag gec tta cag cgc ttc ctt cat gac cct acg eta 3948 

Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe Leu His Asp Pro Thr Leu 

395 400 ~ 405 410 

acc tgg tec gaa tta gtt get age aga aaa ata aga ctt gat tec ccc 3996 

Thr Trp Ser Glu Leu Val Ala Ser Arg Lys lie Arg Leu Asp Ser Pro 

415 420 ~ 425 

tta aaa tta caa ctg eta gaa aat gaa tgg etc tec cgc ctt ttt 4041 

Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp Leu Ser Arg Leu Phe 

430 435 440 



tgagacccag 


ctttcttgta 


caaagtggtg 


ataacatcga 


taatcaacct 


ctggattaca 


4101 


aaatttgtga 


aagattgact 


ggtattctta 


actatgttgc 


tccttttacg 


ctatgtggat 


4161 


aegctgettt 


aatgcctttg 


tatcatgeta 


ttgcttcccg 


tatggctttc 


attttctcct 


4221 


ccttgtataa 


atcctggttg 


ctgtctcttt 


atgaggagtt 


gtggcccgtt 


gtcaggcaac 


4281 


gtggcgtggt 


gtgcactgtg 


tttgctgacg 


caacccccac 


tggttggggc 


attgccacca 


4341 


cctgtcagct 


cctttccggg 


actttcgett 


tccccctccc 


tattgecacg 


geggaactea 


4401 


tcgccgcctg 


ccttgcccgc 


tgctggacag 


gggctegget 


gttgggcact 


gacaattccg 


4461 


tggtgttgtc 


ggggaaatca 


tcgtcctttc 


ettggctget 


cgcctgtgtt 


gccacctgga 


4521 


ttctgegegg 


gacgtccttc 


tgctacgtcc 


cttcggccct 


caatccagcg 


gaccttcctt 


4581 


cccgcggcct 


gctgccggct 


ctgcggcctc 


ttccgcgtct 


tcgccttcgc 


cctcagacga 


4641 


gteggatetc 


cctttgggcc 


gcctccccgc 


ctgatcgata 


aaataaaaga 


ttttatttag 


4701 


tctccagaaa 


aaggggggaa 


tgaaagaccc 


cacctgtagg 


tttggcaagc 


tagcttaagt 


4761 


aacgecattt 


tgcaaggcat 


ggaaaaatac 


ataactgaga 


atagagaagt 


tcagatcaag 


4821 


gtcaggaaca 


gatggaacag 


ctgaatatgg 


gecaaacagg 


atatctgtgg 


taagcagttc 


4881 


ctgccccggc 


teagggecaa 


gaacagatgg 


aacagctgaa 


tatgggccaa 


acaggatatc 


4941 


tgtggtaagc 


agttcctgcc 


ccggctcagg 


gecaagaaca 


gatggtcccc 


agatgeggtc 


5001 


cagccctcag 


cagtttctag 


agaaccatca 


gatgtttcca 


gggtgcccca 


aggacctgaa 


5061 


atgaccctgt 


gecttatttg 


aactaaccaa 


teagttcget 


tctcgcttct 


gttcgcgcgc 


5121 


ttctgctccc 


cgagctcaat 


aaaagagece 


acaacccctc 


acteggggeg 


ccagtcctcc 


5181 


gattgactga 


gtcgcccggg 


tacccgtgta 


tccaataaac 


cctcttgcag 


ttgeatcega 


5241 


cttgtggtct 


cgctgttcct 


tgggagggtc 


tcctctgagt 


gattgactac 


ccgtcagcgg 


5301 


gggtctttca 


tttttccatt 


gggggctcgt 


cegggategg 


gagacccctg 


cccagggacc 


5361 


accgacccac 


cacegggagg 


taagctggct 


gcctcgcgcg 


tttcggtgat 


gacggtgaaa 


5421 
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acctctgaca 


catgcagctc 


ccggagacgg 


tcacagcttg 


tctgtaagcg 


gatgccggga 


5481 


gcagacaagc 


ccgtcagggc 


gcgtcagcgg 


gtgttggcgg 


gtgtcggggc 


gcagccatga 


5541 


cccagtcacg 


tagcgatagc 


ggagtgtata 


ctggcttaac 


tatgcggcat 


cagagcagat 


5601 


tgtactgaga 


gtgcaccata 


tgcggtgtga 


aataccgcac 


agatgcgtaa 


ggagaaaata 


5661 


ccgcatcagg 


cgctcttccg 


cttcctcgct 


cactgactcg 


ctgcgctcgg 


tcgttcggct 


5721 


gcggcgagcg 


gtatcagctc 


actcaaaggc 


ggtaatacgg 


ttatccacag 


aatcagggga 


5781 


taacgcagga 


aagaacatgt 


gagcaaaagg 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


5841 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


cccccctgac 


gagcatcaca 


aaaatcgacg 


5901 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


actataaaga 


taccaggcgt 


ttccccctgg 


5961 


aagctccctc 


gtgcgctctc 


ctgttccgac 


cctgccgctt 


accggatacc 


tgtccgcctt 


6021 


tctcccttcg 


ggaagcgtgg 


cgctttctca 


tagctcacgc 


tgtaggtatc 


tcagttcggt 


6081 


gtaggtcgtt 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


6141 


cgccttatcc 


ggtaactatc 


gtcttgagtc 


caacccggta 


agacacgact 


tatcgccact 


6201 


ggcagcagcc 


actggtaaca 


ggattagcag 


agcgaggtat 


gtaggcggtg 


ctacagagtt 


6261 


cttgaagtgg 


tggcctaact 


acggctacac 


tagaaggaca 


gtatttggta 


tctgcgctct 


6321 


nr^na n r~\ f~ f ^ 
yCLyadyLLd. 


tj L LdLLt icy 


ydddddydy l 


tnntanrtrl" 
Ly y LdyL LLL 


LydLLLyyt-d 




OJOl 


cgctggtagc 


ggtggttttt 


ttgtttgcaa 


gcagcagatt 


acgcgcagaa 


aaaaaggatc 


6441 


tcaagaagat 


cctttgatct 


tttctacggg 


gtctgacgct 


cagtggaacg 


aaaactcacg 


6501 


ttaagggatt 


ttggtcatga 


gattatcaaa 


aaggatcttc 


acctagatcc 


ttttaaatta 


6561 


aaaatgaagt 


tttaaatcaa 


tctaaagtat 


atatgagtaa 


acttggtctg 


acag tta 


6618 



Leu 

cca atg ctt aat cag tga ggc acc tat etc age gat ctg tct att teg 6666 

Pro Met Leu Asn Gin Gly Thr Tyr Leu Ser Asp Leu Ser lie Ser 

445 450 455 

ttc ate cat agt tgc ctg act ccc cgt cgt gta gat aac tac gat acg 6714 

Phe lie His Ser Cys Leu Thr Pro Arg Arg val Asp Asn Tyr Asp Thr 

460 ' 465 ~ 470 

gga gqg ctt acc ate tgg ccc cag tgc tgc aat gat acc gcg aga ccc 6762 

Gly Gly Leu Thr lie Trp Pro Gin Cys Cys Asn Asp Thr Ala Arg Pro 

475 480 485 

acg etc acc ggc tec aga ttt ate age aat aaa cca gee age egg aag 6810 

Thr Leu Thr Gly Ser Arg Phe lie Ser Asn Lys Pro Ala Ser Arg Lys 

490 495 500 505 

ggc cga gcg cag aag tgg tec tgc aac ttt ate cgc etc cat cca gtc 6858 

Gly Arg Ala Gin Lys Trp Ser cys Asn Phe lie Arg Leu His Pro val 

510 515 ~ 520 

tat taa ttg ttg ccg gga age tag agt aag tag ttc gec agt taa tag 6906 

Page 17 



warf Sequence.ST25.txt 

Tyr Leu Leu Pro Gly Ser Ser Lys Phe Ala Ser 

525 530 

ttt gcg caa cgt tgt tgc cat tgc tgc agg cat cgt gqt gtc acg etc 6954 

Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly val Thr Leu 

535 ~ 540 545 

gtc gtt tgg tat gqc ttc att cag etc egg ttc cca acg ate aag gcg 7002 

val Val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala 

550 555 560 

agt tac atg ate ccc cat gtt gtq caa aaa age gqt tag etc ctt egg 7050 

Ser Tyr Met lie Pro His Val Val Gin Lys Ser Gly Leu Leu Arg 

565 570 575 

tec tec gat cgt tgt cag aag taa gtt gqc cgc agt gtt ate act cat 7098 

Ser Ser Asp Arg Cys Gin Lys Val Gly Arg Ser Val lie Thr His 

580 585 590 

gqt tat gqc age act gca taa ttc tct tac tgt cat gee ate cgt aag 7146 

Gly Tyr Gly Ser Thr Ala Phe Ser Tyr Cys His Ala lie Arg Lys 

595 600 605 

atg ctt ttc tgt gac tgg tga gta etc aac caa gtc att ctg aga ata 7194 

Met Leu Phe Cys Asp Trp Val Leu Asn Gin val lie Leu Arg lie 

610 " 615 620 



gtq tat gcg gcg acc gag ttg etc ttg ccc gqc gtc aac acg gqa taa 
val Tyr Ala Ala Thr Glu Leu Leu Leu Pro Gly val Asn Thr Gly 
625 630 635 



7242 



tac cgc gee aca tag cag aac ttt aaa agt get cat cat tgg aaa acg 7290 

Tyr Arg Ala Thr Gin Asn Phe Lys Ser Ala His His Trp Lys Thr 

640 645 650 

ttc ttc gqg gcg aaa act etc aag gat ctt acc get gtt gag ate cag 7338 

Phe Phe Gly Ala Lys Thr Leu Lys Asp Leu Thr Ala val Glu lie Gin 
655 660 665 670 

ttc gat gta acc cac teg tgc acc caa ctg ate ttc age ate ttt tac 7386 

Phe Asp val Thr His Ser Cys Thr Gin Leu lie Phe Ser lie Phe Tyr 
675 ' 680 685 

ttt cac cag cgt ttc tgg gtq age aaa aac agg aag gca aaa tgc cgc 7434 

Phe His Gin Arg Phe Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg 
690 695 700 

aaa aaa gqg aat aag gqc gac acg gaa atg ttg aat act cat 7476 

Lys Lys Gly Asn Lys Gly Asp Thr Glu Met Leu Asn Thr His 
705 710 715 

actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca tgageggata 7536 

catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa 7596 

agtgccacct gaegtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg 7656 

tatcacgagg ccctttcgtc ttcaagaat 7685 



<210> 15 
<211> 132 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic Construct 
<400> 15 

Met Ala Lys Pro Leu Ser Gin Glu Glu Ser Thr Leu lie Glu Arg Ala 
1 5 10 15 

Thr Ala Thr lie Asn Ser lie Pro lie Ser Glu Asp Tyr ser val Ala 
20 25 30 

Ser Ala Ala Leu Ser Ser Asp Gly Arg lie Phe Thr Gly Val Asn Val 
35 40 45 

Tyr His Phe Thr Gly Gly Pro Cys Ala Glu Leu val val Leu Gly Thr 
50 55 60 

Ala Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys lie val Ala lie Gly 
65 70 75 80 

Asn Glu Asn Arg Gly lie Leu Ser Pro Cys Gly Arg Cys Arg Gin Val 
85 90 95 

Leu Leu Asp Leu His Pro Gly lie Lys Ala lie Val Lys Asp Ser Asp 
100 105 110 

Gly Gin Pro Thr Ala Val Gly lie Arg Glu Leu Leu Pro Ser Gly Tyr 
115 120 125 



val Trp Glu Gly 
130 



<210> 16 

<211> 309 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 16 

Met Ala Ser Val val Gly Trp Gly Pro His Ser Leu His Ala Cys Pro 
1 5 10 15 

Ala Leu val Leu Ser Asn Asp Val Thr lie Asp Ala Trp Cys Pro Leu 
20 25 30 



Cys Gly Pro His Glu Arg Leu Gin Phe Glu Arg lie Asp Thr Thr Leu 
35 40 45 
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Thr Cys Glu Thr His Arg lie Asn Trp Thr Ala Asp Gly Arg Pro Cys 
50 55 60 

Gly Leu Asn Gly Thr Leu Phe Pro Arg Leu His val Ser Glu Thr Arg 

65 70 75 80 

Pro Gin Gly Pro Arg Arg Leu Trp lie Asn Cys Pro Leu Pro Ala Val 
85 90 95 

Arg Ala Gin Pro Gly Pro val Ser Leu Ser Pro Phe Glu Arg Ser Pro 
100 105 110 

Phe Gin Pro Tyr Gin Cys Gin Leu Pro Ser Ala Ser Ser Asp Gly Cys 
115 120 125 

Pro lie lie Gly His Gly Leu Leu Pro Trp Asn Asn Leu val Thr His 
130 135 140 

Pro val Leu Arg Lys val Leu lie Leu Asn Gin Met Ala Asn Phe Ser 

145 150 155 160 

Leu Leu Pro Ser Phe Asp Thr Leu Leu val Asp Pro Leu Arg Leu Ser 
165 170 175 

val Phe Ala Pro Asp Thr Arg Gly Ala lie Arg Tyr Leu Ser Thr Leu 
180 185 190 

Leu Thr Leu Cys Pro Ala Thr Cys lie Leu Pro Leu Gly Glu Pro Phe 
195 200 205 

Ser Pro Asn val Pro lie Cys Arg Phe Pro Arg Asp Ser Asn Glu Pro 
210 215 ~ 220 

Pro Leu Ser Glu Phe Glu Leu Pro Leu lie Gin Thr Pro Gly Leu Ser 

225 230 235 240 

Trp Ser val Pro Ala lie Asp Leu Phe Leu Thr Gly Pro Pro Ser Pro 
245 250 255 

Cys Asp Arg Leu His val Trp Ser Ser Pro Gin Ala Leu Gin Arg Phe 
260 265 270 

Leu His Asp Pro Thr Leu Thr Trp Ser Glu Leu val Ala Ser Arg Lys 
275 280 285 

lie Arg Leu Asp Ser Pro Leu Lys Leu Gin Leu Leu Glu Asn Glu Trp 
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290 295 300 



Leu Ser Arg Leu Phe 
305 



<210> 17 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 17 

Leu Pro Met Leu Asn Gin 
1 5 



<210> 18 

<211> 75 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 18 

Gly Thr Tyr Leu ser Asp Leu Ser lie Ser Phe lie His Ser Cys Leu 
15 10 15 

Thr Pro Arg Arg Val Asp Asn Tyr Asp Thr Gly Gly Leu Thr lie Trp 
20 25 30 

Pro Gin Cys Cys Asn Asp Thr Ala Arg Pro Thr Leu Thr Gly Ser Arg 
35 40 45 

Phe lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin Lys Trp 
50 55 60 

Ser Cys Asn Phe lie Arg Leu His Pro val Tyr 
65 70 75 

<210> 19 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 19 

Leu Leu Pro Gly Ser 
1 5 

Page 21 



Warf Sequence.ST25.txt 



<210> 20 

<211> 44 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 20 

Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly Val Thr Leu 
1 5 10 15 



val val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala 
20 25 30 



Ser Tyr Met lie Pro His val val Gin Lys Ser Gly 
35 40 



<210> 21 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 21 

Leu Leu Arg Ser Ser Asp Arg Cys Gin Lys 
1 5 10 



<210> 22 

<211> 14 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 22 

val Gly Arg Ser val lie Thr His Gly Tyr Gly Ser Thr Ala 
1 ' " 5 10 



<210> 23 

<211> 15 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 23 

Phe Ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys Asp Trp 
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15 10 15 

<210> 24 

<211> 24 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 24 

val Leu Asn Gin val lie Leu Arg lie val Tyr Ala Ala Thr Glu Leu 
1 5 ^10 15 

Leu Leu Pro Gly val Asn Thr Gly 
20 



<210> 25 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 25 

Tyr Arg Ala Thr 
1 



<210> 26 

<211> 73 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 26 

Gin Asn Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala Lys 
15 10 15 

Thr Leu Lys Asp Leu Thr Ala val Glu He Gin Phe Asp val Thr His 
20 25 30 

Ser Cys Thr Gin Leu lie Phe Ser lie Phe Tyr Phe His Gin Arg Phe 
35 40 45 

Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn Lys 
50 ' 55 60 

Gly Asp Thr Glu Met Leu Asn Thr His 
65 70 
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<210> 27 

<211> 7430 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered Sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (1)..(589) 

<223> 5' MOMUSVLTR 

<220> 

<221> CDS 

<222> (1512). .(2306) 

<223> neomycin resistance 

<220> 

<221> mi sc_feature 

<222> (3023). .(3047) 

<223> attBl 

<220> 

<221> CDS 

<222> (3120). .(3590) 

<223> trans-dominant BLV Rex (M4) 

<220> 

<221> misc_feature 

<222> (3653). .(4282) 

<223> attB2 

<220> 

<221> misc_signal 

<222> (3690). .(4282) 

<223> WPRE; woodchuck hepatitis virus post-transcriptional regulatory 
element 

<220> 

<221> LTR 

<222> (4328). .(4921) 

<223> 3' MoMuLVLTR 

<220> 

<221> CDS 

<222> (6217). .(7077) 

<223> ampicillin resistance 

<400> 27 



tttgaaagac cccacccgta 


ggtggcaagc 


tagcttaagt 


aacgccactt 


tgcaaggcat 


60 


ggaaaaatac ataactgaga 


atagaaaagt 


tcagatcaag 


gtcaggaaca 


aagaaacagc 


120 


tgaataccaa acaggatatc 


tgtggtaagc 


ggttcctgcc 


ccggctcagg 


gccaagaaca 


180 


gatgagacag ctgagtgatg 


ggccaaacag 


gatatctgtg 


gtaagcagtt 


cctgccccgg 


240 


ctcggggcca agaacagatg 


gtccccagat 


gcggtccagc 


cctcagcagt 


ttctagtgaa 


300 
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tcatcagatg 


tttccagggt 


gccccaagga 


cctgaaaatg 


accctgtacc 


ttatttgaac 


360 


taaccaatca 


gttcgcttct 


cgcttctgtt 


cgcgcgcttc 


cgctctccga 


gctcaataaa 


420 


agagcccaca 


acccctcact 


cggcgcgcca 


gtcttccgat 


agactgcgtc 


gcccgggtac 


480 


ccgtattccc 


aataaagcct 


cttgctgttt 


gcatccgaat 


cgtggtctcg 


ctgttccttg 


540 


ggagggtctc 


ctctgagtga 


ttgactaccc 


acgacggggg 


tctttcattt 


gggggctcgt 


600 


ccgggatttg 


gagacccctg 


cccagggacc 


accgacccac 


caccgggagg 


taagctggcc 


660 


agcaacttat 


ctgtgtctgt 


ccgattgtct 


agtgtctatg 


tttgatgtta 


tgcgcctgcg 


720 


tctgtactag 


ttagctaact 


agctctgtat 


ctggcggacc 


cgtggtggaa 


ctgacgagtt 


780 


ctgaacaccc 


ggccgcaacc 


ctgggagacg 


tcccagggac 


tttgggggcc 


gtttttgtgg 


840 


cccgacctga 


ggaagggagt 


cgatgtggaa 


tccgaccccg 


tcaggatatg 


tggttctggt 


900 


aggagacgag 


aacctaaaac 


agttcccgcc 


tccgtctgaa 


tttttgcttt 


cggtttggaa 


960 


ccgaagccgc 


gcgtcttgtc 


tgctgcagcg 


ctgcagcatc 


gttctgtgtt 


gtctctgtct 


1020 


gactgtgttt 


ctgtatttgt 


ctgaaaatta 


gggccagact 


gttaccactc 


ccttaagttt 


1080 


gaccttaggt 


cactggaaag 


atgtcgagcg 


gatcgctcac 


aaccagtcgg 


tagatgtcaa 


1140 


gaagagacgt 


tgggttacct 


tctgctctgc 


agaatggcca 


acctttaacg 


tcggatggcc 


1200 


gcgagacggc 


acctttaacc 


gagacctcat 


cacccaggtt 


aagatcaagg 


tcttttcacc 


1260 


tggcccgcat 


ggacacccag 


accaggtccc 


ctacatcgtg 


acctgggaag 


ccttggcttt 


1320 


tgacccccct 


ccctgggtca 


agccctttgt 


acaccctaag 


cctccgcctc 


ctcttcctcc 


1380 


atccgccccg 


tctctccccc 


ttgaacctcc 


tcgttcgacc 


ccgcctcgat 


cctcccttta 


1440 


tccagccctc 


actccttctc 


taggcgccgg 


aattccgatc 


tgatcaagag 


acaggatgag 


1500 


gatcgtttcg 


c atg att gaa caa gat 
Met lie Glu Gin Asp 


gga ttg cac gca ggt tct ccg gcc 
Gly Leu His Ala Gly Ser Pro Ala 


1550 



1 5 10 

get tgg gtg gag agg eta ttc ggc tat gac tgg gca caa cag aca ate 1598 

Ala Trp val Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie 

15 " 20 25 

ggc tgc tct gat gcc gcc gtg ttc egg ctg tea gcg cag ggg cgc ccg 1646 

Gly Cys Ser Asp Ala Ala val Phe Arg Leu Ser Ala Gin Gly Arg Pro 

30 35 40 45 

gtt ctt ttt gtc aag ace gac ctg tec ggt gcc ctg aat gaa ctg cag 1694 

val Leu Phe val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin 

50 55 60 

gac gag gca gcg egg eta teg tgg ctg gcc acg acg ggc gtt cct tgc 1742 

Asp Glu Ala Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys 

65 70 75 

gca get gtg etc gac gtt gtc act gaa gcg gga agg gac tgg ctg eta 1790 

Ala Ala val Leu Asp val val Thr Glu Ala Gly Arg Asp Trp Leu Leu 

80 85 90 
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ttg gqc gaa gtq ccg gqg cag gat etc ctg tea tct cac ctt get cct 1838 

Leu Gly Glu vai Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro 

95 100 105 

gec gag aaa gta tec ate atg get gat gca atg egg egg ctg cat acg 1886 

Ala Glu Lys val Ser lie Met Ala Asp Ala Met Arg Arg Leu His Thr 
110 115 120 ^ 125 

ctt gat ccg get ace tgc cca ttc gac cac caa gcg aaa cat cgc ate 1934 

Leu Asp Pro Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg lie 

130 135 140 

gag cga gca cgt act egg atg gaa gee gqt ctt gtc gat cag gat gat 1982 

Glu Arg Ala Arg Thr Arg Met Glu Ala Gly Leu val Asp Gin Asp Asp 
145 150 155 

ctg gac gaa gag cat cag gqg etc gcg cca gee gaa ctg ttc gee agg 2030 

Leu Asp Glu Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg 
160 165 170 

etc aag gcg cgc atg ccc gac gqc gag gat etc gtc gtq ace cat gqc 2078 

Leu Lys Ala Arg Met Pro Asp Gly Glu Asp Leu val val Thr His Gly 

175 180 185 

gat gee tgc ttg ccg aat ate atg gtq gaa aat ggc cgc ttt tct gga 2126 

Asp Ala Cys Leu Pro Asn lie Met Val Glu Asn Gly Arg Phe Ser Gly 
190 195 200 205 



ttc ate gac tgt gqc egg ctg gqt gtq gcg gac cgc tat cag gac ata 
Phe lie Asp Cys Gly Arg Leu Gly vat Ala Asp Arg Tyr Gin Asp lie 
210 " 215 220 



2174 



gcg ttg get ace cgt gat att get gaa gag ctt ggc gqc gaa tgg get 2222 

Ala Leu Ala Thr Arg Asp lie Ala Glu Glu Leu Gly Gly Glu Trp Ala 
225 230 235 

gac cgc ttc etc gtq ctt tac gqt ate gee get ccc gat teg cag cgc 2270 

Asp Arg Phe Leu vat Leu Tyr Gly lie Ala Ala Pro Asp Ser Gin Arg 
240 245 250 



ate gee ttc tat cgc ctt ctt gac gag ttc ttc 
lie Ala Phe Tyr Arg Leu Leu Asp Glu Phe Phe 
255 260 


tga gegggactet 


2316 


ggggttcgaa 


atgaccgacc 


aagcgacgcc 


caacctgcca 


tcacgagatt 


tcgattccac 


2376 


cgccgccttc 


tatgaaaggt 


tgggcttegg 


aatcgttttc 


cgggacgccg 


gctggatgat 


2436 


cctccagcgc 


ggggatctca 


tgctggagtt 


cttcgcccac 


cccgggctcg 


atcccctcgc 


2496 


gagttggttc 


agctgctgcc 


tgaggctgga 


cgacctcgcg 


gagttctacc 


ggcagtgcaa 


2556 


atccgtcggc 


atccaggaaa 


ccagcagcgg 


ctatccgcgc 


atccatgccc 


ccgaactgca 


2616 


ggagtgggga 


ggcacgatgg 


ccgctttggt 


egaggeggat 


cctagcagaa 


aaataagact 


2676 


tgattccccc 


ttaaaattac 


aactgetaga 


aaatgaatgg 


ctctcccgcc 


ttttttgagg 


2736 


gggaatcatt 


tgtatgaaag 


ateatgeega 


cctaggcgcc 


gccaccgccc 


cgtaaaccag 


2796 


acagagacgt 


cagctgccag 


aaaagctggt 


gaeggcaget 
Page 


ggtggctaga 
26 


atccccgtac 


2856 
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ctccccaact tcccctttcc cgaaaaatcc acaccctgag ctgctgacct cacctgctga 2916 

taaattaata aaatgccggc cctgtcgagt tagcggcacc agaagcgttc ttctcctgag 2976 

accctcgtgc tcagctctcg gtcctgcctc gagaagcttg ttatcaacaa gtttgtacaa 3036 

aaaagcaggc ttcgaaggag atagaaccaa ttctctaagg aaatacttaa cgtcgactgg 3096 

atccggtacc gaattcgatc cac atg cct aaa aaa cga egg tec cga aga cgc 3149 

Met Pro Lys Lys Arg Arg ser Arg Arg Arg 

265 270 

cca caa ccg ate ate aga tgg caa gtg ttg ttg gtt ggg ggc ccc act 3197 

pro Gin Pro lie lie Arg Trp Gin Vat Leu Leu val Gly Gly Pro Thr 

275 280 285 290 

etc tac atg cct gec egg ccc tgg ttt tgt cca atg atg tea cca teg 3245 

Leu Tyr Met Pro Ala Arg Pro Trp Phe Cys Pro Met Met Ser Pro Ser 

295 300 305 

atg cct gqt gec ccc tct gcg ggc ccc atg age gac tec aat teg aaa 3293 

Met Pro Gly Ala Pro Ser Ala Gly Pro Met Ser Asp Ser Asn Ser Lys 

310 315 320 

gga teg aca cca cgc tea cct gcg aga ccc ace gta tea act gga ccg 3341 

Gly Ser Thr Pro Arg Ser Pro Ala Arg Pro Thr val Ser Thr Gly Pro 

325 330 335 

ccg atg gac gac ctt gcg gec tea atg gaa cgt tgt tec etc gac tgc 3389 

pro Met Asp Asp Leu Ala Ala Ser Met Glu Arg Cys Ser Leu Asp cys 

340 345 350 



atg tct ccg aga ccc gec ccc aag ggc ccc gac gac tct gga tea act 
Met Ser Pro Arg Pro Ala Pro Lys Gly Pro Asp Asp Ser Gly Ser Thr 
355 360 365 370 



cct eta gcg acg gtt gec cca tta teg ggc acg gec ttc ttc cct gga 
Pro Leu Ala Thr val Ala Pro Leu Ser Gly Thr Ala Phe Phe Pro Gly 
405 410 415 



3437 



gee ccc ttc egg ccg ttc gcg etc age ccg gee egg tta gat ctt ccc 3485 

Ala Pro Phe Arg Pro Phe Ala Leu Ser Pro Ala Arg Leu Asp Leu Pro 
375 380 - 385 

cct teg age gqt ccc cct tec age cct ace aat gee aat tgc cct egg 3533 

Pro Ser Ser Gly Pro Pro Ser Ser Pro Thr Asn Ala Asn Cys Pro Arg 
390 395 400 



3581 



aca act tag taacgcatcc tgtcctcaga aaagtcctta tattaaatca 3630 
Thr Thr 
420 

aatgggacct cgagatatct agacccagct ttcttgtaca aagtggttga taacatcgat 3690 

aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct 3750 

ccttttacgc tatgtggata egctgettta atgcctttgt ateatgetat tgcttcccgt 3810 

atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg 3870 

tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgetgaege aacccccact 3930 

ggttggggca ttgccaccac ctgtcagctc etttceggga etttegcttt ccccctccct 3990 
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attgccacgg 


cggaactcat 


cgccgcctgc 


cttgcccgct 


gctggacagg 


ggctcggctg 


4050 


ttgggcactg 


acaattccgt 


ggtgttgtcg 


gggaaatcat 


cgtcctttcc 


ttggctgctc 


4110 


gcctgtgttg 


ccacctggat 


tctgcgcggg 


acgtccttct 


gctacgtccc 


ttcggccctc 


4170 


aatccagcgg 


accttccttc 


ccgcggcctg 


ctgccggctc 


tgcggcctct 


tccgcgtctt 


4230 


cgccttcgcc 


ctcagacgag 


tcggatctcc 


ctttgggccg 


cctccccgcc 


tgatcgataa 


4290 


aataaaagat 


tttatttagt 


ctccagaaaa 


aggggggaat 


gaaagacccc 


acctgtaggt 


4350 


ttggcaagct 


agcttaagta 


acgccatttt 


gcaaggcatg 


gaaaaataca 


taactgagaa 


4410 


tagagaagtt 


cagatcaagg 


tcaggaacag 


atggaacagc 


tgaatatggg 


ccaaacagga 


4470 


tatctgtggt 


aagcagttcc 


tgccccggct 


cagggccaag 


aacagatgga 


acagctgaat 


4530 


atgggccaaa 


caggatatct 


gtggtaagca 


gttcctgccc 


cggctcaggg 


ccaagaacag 


4590 


atggtcccca 


gatgcggtcc 


agccctcagc 


agtttctaga 


gaaccatcag 


atgtttccag 


4650 


ggtgccccaa 


ggacctgaaa 


tgaccctgtg 


ccttatttga 


actaaccaat 


cagttcgctt 


4710 


ctcgcttctg 


ttcgcgcgct 


tctgctcccc 


gagctcaata 


aaagagccca 


caacccctca 


4770 


ctcggggcgc 


cagtcctccg 


attgactgag 


tcgcccgggt 


acccgtgtat 


ccaataaacc 


4830 


ctcttgcagt 


tgcatccgac 


ttgtggtctc 


gctgttcctt 


gggagggtct 


cctctgagtg 


4890 


attgactacc 


cgtcagcggg 


ggtctttcat 


ttgggggctc 


gtccgggatc 


gggagacccc 


4950 


tgcccaggga 


ccaccgaccc 


accaccggga 


ggtaagctgg 


ctgcctcgcg 


cgtttcggtg 


5010 


atgacggtga 


aaacctctga 


cacatgcagc 


tcccggagac 


ggtcacagct 


tgtctgtaag 


5070 


cggatgccgg 


gagcagacaa 


gcccgtcagg 


gcgcgtcagc 


gggtgttggc 


gggtgtcggg 


5130 


gcgcagccat 


gacccagtca 


cgtagcgata 


gcggagtgta 


tactggctta 


actatgcggc 


5190 


atcagagcag 


attgtactga 


gagtgcacca 


tatgcggtgt 


gaaataccgc 


acagatgcgt 


5250 


aaggagaaaa 


taccgcatca 


ggcgctcttc 


cgcttcctcg 


ctcactgact 


cgctgcgctc 


5310 


ggtcgttcgg 


ctgcggcgag 


cggtatcagc 


tcactcaaag 


gcggtaatac 


ggttatccac 


5370 


agaatcaggg 


gataacgcag 


gaaagaacat 


gtgagcaaaa 


ggccagcaaa 


aggccaggaa 


5430 


ccgtaaaaag 


gccgcgttgc 


tggcgttttt 


ccataggctc 


cgcccccctg 


acgagcatca 


5490 


caaaaatcga 


cgctcaagtc 


agaggtggcg 


aaacccgaca 


ggactataaa 


gataccaggc 


5550 


gtttccccct 


ggaagctccc 


tcgtgcgctc 


tcctgttccg 


accctgccgc 


ttaccggata 


5610 


cctgtccgcc 


tttctccctt 


cgggaagcgt 


ggcgctttct 


catagctcac 


gctgtaggta 


5670 


tctcagttcg 


gtgtaggtcg 


ttcgctccaa 


gctgggctgt 


gtgcacgaac 


cccccgttca 


5730 


gcccgaccgc 


tgcgccttat 


ccggtaacta 


tcgtcttgag 


tccaacccgg 


taagacacga 


5790 


cttatcgcca 


ctggcagcag 


ccactggtaa 


caggattagc 


agagcgaggt 


atgtaggcgg 


5850 
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tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 5910 

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 5970 

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6030 

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6090 

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6150 

ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6210 

tgacag tta cca atg ctt aat cag tga gqc acc tat etc age gat ctg 6258 

Leu Pro Met Leu Asn Gin Gly Thr Tyr Leu Ser Asp Leu 

425 430 

tct att teg ttc ate cat agt tgc ctg act ccc cgt cgt gta gat aac 6306 

Ser lie Ser Phe lie His Ser Cys Leu Thr Pro Arg Arg val Asp Asn 

435 440 445 

tac gat acg gga ggg ctt acc ate tgg ccc cag tgc tgc aat gat acc 6354 

Tyr Asp Thr Gly Gly Leu Thr lie Trp Pro Gin Cys Cys Asn Asp Thr 

450 * 455 460 465 

gcg aga ccc acg etc acc gqc tec aga ttt ate age aat aaa cca gee 6402 

Ala Arg Pro Thr Leu Thr Gly Ser Arg Phe lie Ser Asn Lys Pro Ala 

470 475 480 

age egg aag gqc cga gcg cag aag tgg tec tgc aac ttt ate cgc etc 6450 

Ser Arg Lys Gly Arg Ala Gin Lys Trp Ser Cys Asn Phe lie Arg Leu 

485 ~ 490 495 

cat cca gtc tat taa ttg ttg ccg gga age tag agt aag tag ttc gee 6498 

His Pro val Tyr Leu Leu Pro Gly Ser Ser Lys Phe Ala 

500 505 510 

agt taa tag ttt gcg caa cgt tgt tgc cat tgc tgc agg cat cgt gqt 6546 

Ser Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly 

515 520 

gtc acg etc gtc gtt tgg tat ggc ttc att cag etc egg ttc cca acg 6594 

Val Thr Leu val val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr 

525 530 535 540 

ate aag gcg agt tac atg ate ccc cat gtt gtg caa aaa age gqt tag 6642 

lie Lys Ala Ser Tyr Met lie Pro His val val Gin Lys Ser Gly 

545 550 555 

etc ctt egg tec tec gat cgt tgt cag aag taa gtt gqc cgc agt gtt 6690 

Leu Leu Arg Ser Ser Asp Arg Cys Gin Lys val Gly Arg Ser val 

560 ~ 565 ~ 570 

ate act cat gqt tat gqc age act gca taa ttc tct tac tgt cat gec 6738 

lie Thr His Gly Tyr Gly Ser Thr Ala Phe Ser Tyr Cys His Ala 

575 ' 580 585 

ate cgt aag atg ctt ttc tgt gac tgg tga gta etc aac caa gtc att 6786 

lie Arg Lys Met Leu Phe Cys Asp Trp Val Leu Asn Gin val lie 

590 595 600 

ctg aga ata gtg tat gcg gcg acc gag ttg etc ttg ccc gqc gtc aac 6834 

Leu Arg lie val Tyr Ala Ala Thr Glu Leu Leu Leu Pro Gly val Asn 

605 610 615 
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acg gqa taa tac cgc gcc aca tag cag aac ttt aaa agt get cat cat 6882 

Thr Gly Tyr Arg Ala Thr Gin Asn Phe Lys Ser Ala His His 

620 625 630 

tgg aaa acg ttc ttc ggg gcg aaa act etc aag gat ctt acc get gtt 6930 

Trp Lys Thr Phe Phe Gly Ala Lys Thr Leu Lys Asp Leu Thr Ala Val 

635 640 645 

gag ate cag ttc gat gta acc cac teg tgc acc caa ctg ate ttc age 6978 

Glu lie Gin Phe Asp val Thr His Ser Cys Thr Gin Leu lie Phe Ser 

650 655 660 

ate ttt tac ttt cac cag cgt ttc tgg gtq age aaa aac agg aag gca 7026 

lie Phe Tyr Phe His Gin Arg Phe Trp Val Ser Lys Asn Arg Lys Ala 

665 670 675 

aaa tgc cgc aaa aaa gqg aat aag ggc gac acg gaa atg ttg aat act 7074 

Lys Cys Arg Lys Lys Gly Asn Lys Gly Asp Thr Glu Met Leu Asn Thr 

680 685 690 

cat actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 7127 

His 

695 



tgageggata 


catatttgaa 


tgtatttaga aaaataaaca 


aataggggtt 


ccgcgcacat 


7187 


ttccccgaaa 


agtgccacct 


gaegtctaag 


aaaccattat 


tatcatgaca 


ttaacctata 


7247 


aaaataggcg 


tatcacgagg 


ccctttcgtc 


ttcaagaatt 


aattcatacc 


agatcaccga 


7307 


aaactgtcct 


ccaaatgtgt 


ccccctcaca 


ctcccaaatt 


cgcgggcttc 


tgectcttag 


7367 


accactctac 


cctattcccc 


acactcaccg 


gagecaaage 


cgcggccctt 


ccgtttcttt 


7427 


get 












7430 



<210> 28 

<211> 264 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 28 

Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp val 
1 5 10 15 

Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser 
20 " 25 30 

Asp Ala Ala val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 
35 40 45 

val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 
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Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala val 
65 70 75 80 

Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 

val Pro Gly Gin Asp Leu Leu Ser ser His Leu Ala Pro Ala Glu Lys 
100 105 110 

val Ser lie Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
115 120 125 

Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg lie Glu Arg Ala 

130 135 140 

Arg Thr Arg Met Glu Ala Gly Leu val Asp Gin Asp Asp Leu Asp Glu 
145 ~ 150 155 160 

Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
165 170 ~ 175 

Arg Met Pro Asp Gly Glu Asp Leu val val Thr His Gly Asp Ala Cys 
180 185 190 

Leu Pro Asn lie Met val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp 
195 200 205 

Cys Gly Arg Leu Gly val Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala 

210 " 215 220 



Thr Arg Asp lie Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 

225 230 235 240 

Leu val Leu Tyr Gly lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe 
245 250 255 

Tyr Arg Leu Leu Asp Glu Phe Phe 
260 



<210> 29 

<211> 156 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 29 
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Met Pro Lys Lys Arg Arg Ser Arg Arg Arg Pro Gin Pro lie lie Arg 
1 5 10 15 



Trp Gin Val Leu Leu val Gly Gly Pro Thr Leu Tyr Met Pro Ala Arg 
20 25 30 

pro Trp Phe Cys Pro Met Met Ser Pro Ser Met Pro Gly Ala Pro Ser 
35 40 45 

Ala Gly Pro Met Ser Asp Ser Asn Ser Lys Gly Ser Thr Pro Arg Ser 
50 55 60 

Pro Ala Arg Pro Thr val Ser Thr Gly Pro Pro Met Asp Asp Leu Ala 
65 70 75 80 

Ala Ser Met Glu Arg Cys Ser Leu Asp Cys Met Ser Pro Arg Pro Ala 
85 90 95 

Pro Lys Gly Pro Asp Asp Ser Gly Ser Thr Ala Pro Phe Arg Pro Phe 
100 105 110 

Ala Leu Ser Pro Ala Arg Leu Asp Leu Pro Pro Ser Ser Gly Pro Pro 
115 " 120 125 

Ser Ser Pro Thr Asn Ala Asn Cys Pro Arg Pro Leu Ala Thr val Ala 
130 135 140 

Pro Leu Ser Gly Thr Ala Phe Phe Pro Gly Thr Thr 
145 150 155 



<210> 30 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 30 

Leu Pro Met Leu Asn Gin 
1 5 



<210> 31 

<211> 75 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 31 
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Gly Thr Tyr Leu Ser Asp Leu Ser lie Ser Phe lie His Ser Cys Leu 
1 5 10 15 

Thr Pro Arg Arg Val Asp Asn Tyr Asp Thr Gly Gly Leu Thr lie Trp 
20 25 30 

Pro Gin Cys Cys Asn Asp Thr Ala Arg Pro Thr Leu Thr Gly Ser Arg 
35 40 " 45 

Phe lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin Lys Trp 
50 '55 60 

Ser Cys Asn Phe lie Arg Leu His Pro Val Tyr 
65 70 75 



<210> 32 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 32 

Leu Leu Pro Gly Ser 
1 5 



<210> 33 

<211> 44 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 33 

Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly val Thr Leu 
1 5 10 15 

val val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala 
20 25 30 

Ser Tyr Met lie Pro His val val Gin Lys Ser Gly 
35 40 

<210> 34 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic construct 
<400> 34 

Leu Leu Arg ser Ser Asp Arg Cys Gin Lys 
1 5 10 



<210> 35 

<211> 14 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 35 

val Gly Arg Ser val lie Thr His Gly Tyr Gly Ser Thr Ala 
15 10 



<210> 36 

<211> 15 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 36 

Phe Ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys Asp Trp 
15 10 15 



<210> 37 

<211> 24 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 37 

val Leu Asn Gin val lie Leu Arg lie val Tyr Ala Ala Thr Glu Leu 
1 5 10 15 



Leu Leu Pro Gly val Asn Thr Gly 
20 



<210> 38 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 38 
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Tyr Arg Ala Thr 
1 
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<210> 39 

<211> 73 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 39 

Gin Asn Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala Lys 
15 10 15 

Thr Leu Lys Asp Leu Thr Ala val Glu lie Gin Phe Asp val Thr His 
20 25 30 

Ser Cys Thr Gin Leu lie Phe Ser lie Phe Tyr Phe His Gin Arg Phe 
35 40 45 

Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn Lys 
50 55 ' 60 

Gly Asp Thr Glu Met Leu Asn Thr His 
65 70 



<210> 40 

<211> 7010 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered Sequence from virus and plasmid 



<220> 

<221> LTR 

<222> (149).. (737) 

<223> 5' MoMuSVLTR 

<220> 

<221> CDS 

<222> (1660). .(2454) 

<223> neomycin resistance 

<220> 

<221> promoter 

<222> (2806). .(3150) 

<223> BLV promoter 

<220> 

<221> misc_feature 

<222> (3170). .(3194) 

<223> attbl 
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<220> 

<221> CDS 

<222> (3236). .(3955) 

<223> EYFP; enhanced yellow florescent protein 
<220> 

<221> misc_feature 

<222> (3980) . . (4004) 

<223> attb2 

<220> 

<221> LTR 

<222> (4056) . . (4649) 

<223> 3' MoMuSVLTR 

<220> 

<221> CDS 

<222> (5945).. (6805) 

<223> ampicillin resistance 

<400> 40 



gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaqgcatqg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 


cgcgcttccg 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


gacgggggtc 


tttcatttgg 


gggctcgtcc 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccgggaggta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tgtctatgtt 


tgatgttatg 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


ggcggacccg 


tggtggaact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccagggactt 


tgggggccgt 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


cgaccccgtc 


aggatatgtg 


gttctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


cgtctgaatt 


tttgctttcg 


gtttggaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


gcagcatcgt 


tctgtgttgt 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


gccagactgt 


taccactccc 


ttaagtttga 


ccttaggtca 
Page 


ctggaaagat 
36 


gtcgagcgga 


1260 
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tcgctcacaa ccagtcggta gatgtcaaga agagacgttg ggttaccttc tgctctgcag 1320 

aatggccaac ctttaacgtc ggatggccgc gagacggcac ctttaaccga gacctcatca 1380 

cccaggttaa gatcaaggtc ttttcacctg gcccgcatgg acacccagac caggtcccct 1440 

acatcgtgac ctgggaagcc ttggcttttg acccccctcc ctgggtcaag ccctttgtac 1500 

accctaagcc tccgcctcct cttcctccat ccgccccgtc tctccccctt gaacctcctc 1560 

gttcgacccc gcctcgatcc tccctttatc cagccctcac tccttctcta ggcgccggaa 1620 

ttccgatctg atcaagagac aggatgagga tcgtttcgc atg att gaa caa gat 1674 

Met lie Glu Gin Asp 
1 5 

gga ttg cac gca ggt tct ccg gcc get tgg gtg gag agg eta ttc ggc 1722 
Gly Leu His Ala Gly Ser Pro Ala Ala Trp val Glu Arg Leu Phe Gly 
10 15 20 

tat gac tgg gca caa cag aca ate gqc tgc tct gat gcc gcc gtg ttc 1770 
Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser Asp Ala Ala val Phe 
25 30 35 

egg ctg tea gcg cag gqg cgc ccg gtt ctt ttt gtc aag acc gac ctg 1818 
Arg Leu Ser Ala Gin Gly Arg Pro val Leu Phe Val Lys Thr Asp Leu 
40 45 50 

tec ggt gcc ctg aat gaa ctg cag gac gag gca gcg egg eta teg tgg 1866 
Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala Ala Arg Leu Ser Trp 
55 60 65 

ctg gcc acg acg gqc gtt cct tgc gca get gtg etc gac gtt gtc act 
Leu Ala Thr Thr Gly val Pro Cys Ala Ala val Leu Asp val val Thr 
70 75 80 85 



1914 



gaa gcg gga agg gac tgg ctg eta ttg gqc gaa gtg ccg gqg cag gat 1962 

Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu val Pro Gly Gin Asp 

90 95 100 

etc ctg tea tct cac ctt get cct gcc gag aaa gta tec ate atg get 2010 

Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys Val Ser lie Met Ala 

105 110 * 115 

gat gca atg egg egg ctg cat acg ctt gat ccg get acc tgc cca ttc 2058 

Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro Ala Thr Cys Pro Phe 

120 ~ 125 130 

gac cac caa gcg aaa cat cgc ate gag cga gca cgt act egg atg gaa 2106 

Asp His Gin Ala Lys His Arg lie Glu Arg Ala Arg Thr Arg Met Glu 

135 140 145 

gcc ggt ctt gtc gat cag gat gat ctg gac gaa gag cat cag gqg etc 2154 

Ala Gly Leu val Asp Gin Asp Asp Leu Asp Glu Glu His Gin Gly Leu 

150 155 160 165 

gcg cca gcc gaa ctg ttc gcc agg etc aag gcg cgc atg ccc gac gqc 2202 

Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala Arg Met Pro Asp Gly 

170 175 ~ 180 

gag gat etc gtc gtg acc cat gqc gat gcc tgc ttg ccg aat ate atg 2250 

Glu Asp Leu val val Thr His Gly Asp Ala Cys Leu Pro Asn lie Met 
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185 190 195 

gtg gaa aat ggc cgc ttt tct gga ttc ate gac tgt ggc egg ctg gqt 

val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp Cys Gly Arg Leu Gly 
200 ~ 205 210 



2298 



gtg gcg gac cgc tat cag gac ata gcg ttg get acc cgt gat att get 2346 

Vat Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala Thr Arg Asp lie Ala 
215 220 225 

gaa gag ctt ggc ggc gaa tgg get gac cgc ttc etc gtg ctt tac gqt 2394 

Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe Leu val Leu Tyr Gly 

230 235 240 245 

ate gee get ccc gat teg cag cgc ate gee ttc tat cgc ctt ctt gac 2442 

lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe Tyr Arg Leu Leu Asp 
250 ~ 255 260 

gag ttc ttc tga gegggactet ggggttcgaa atgaccgacc aagcgacgcc 2494 
Glu Phe Phe 



caacctgcca 


tcacgagatt 


tcgattccac 


cgccgccttc 


tatgaaaggt 


tgggcttegg 


2554 


aatcgttttc 


cgggacgccg 


gctggatgat 


cctccagcgc 


ggggatctca 


tgctggagtt 


2614 


cttcgcccac 


cccgggctcg 


atcccctcgc 


gagttggttc 


agctgctgcc 


tgaggctgga 


2674 


cgacctcgcg 


gagttctacc 


ggcagtgcaa 


atccgtcggc 


atccaggaaa 


ccagcagcgg 


2734 


ctatccgcgc 


atccatgccc 


ccgaactgca 


ggagtgggga 


ggcacgatgg 


ccgctttggt 


2794 


egaggeggat 


cctagcagaa 


aaataagact 


tgattccccc 


ttaaaattac 


aactgetaga 


2854 


aaatgaatgg 


ctctcccgcc 


ttttttgagg 


gggaatcatt 


tgtatgaaag 


ateatgeega 


2914 


cctaggcgcc 


gccaccgccc 


cgtaaaccag 


acagagacgt 


cagctgccag 


aaaagctggt 


2974 


gaeggcaget 


ggtggctaga 


atccccgtac 


ctccccaact 


tcccctttcc 


cgaaaaatcc 


3034 


acaccctgag 


ctgctgacct 


cacctgctga 


taaattaata 


aaatgeegge 


cctgtcgagt 


3094 


tagcggcacc 


agaagegtte 


ttctcctgag 


accctcgtgc 


tcagctctcg 


gtcctgcctc 


3154 


gagaagcttg 


ttatcacaag 


tttgtacaaa 


aaagcaggct 


tcgaaggaga 


tagaaccaat 


3214 


tctctaagga 


aatacttaac 


c atg gtg age aag ggc 
Met val ser Lys Gly 


gag gag ctg ttc acc 
Glu Glu Leu Phe Thr 


3265 



265 270 

999 gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac 3313 

Gly val Val Pro lie Leu val Glu Leu Asp Gly Asp Val Asn Gly His 

275 280 285 290 

aag ttc age gtg tec ggc gag ggc gag ggc gat gec acc tac ggc aag 3361 

Lys Phe Ser val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

295 300 305 

ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg 3409 

Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 

310 315 320 

ccc acc etc gtg acc acc ttc ggc tac ggc ctg cag tgc ttc gec cgc 3457 
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Pro Thr Leu val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg 
325 330 335 

tac ccc gac cac atg aag cag cac gac ttc ttc aag tec gec atg ccc 3505 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
340 345 350 



gaa ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac 
Glu Gly Tyr val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
355 360 ~ 365 370 



3553 



tac aag acc cgc gec gag gtg aag ttc gag ggc gac acc ctg gtg aac 3601 

Tyr Lys Thr Arg Ala Glu val Lys Phe Glu Gly Asp Thr Leu val Asn 

375 380 385 

cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg 3649 

Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 

390 395 400 

gqg cac aag ctg gag tac aac tac aac age cac aac gtc tat ate atg 3697 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn val Tyr lie Met 

405 410 415 

gee gac aag cag aag aac ggc ate aag gtg aac ttc aag ate cgc cac 3745 

Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His 

420 425 * 430 

aac ate gag gac ggc age gtg cag etc gee gac cac tac cag cag aac 3793 

Asn lie Glu Asp Gly Ser vat Gin Leu Ala Asp His Tyr Gin Gin Asn 

435 440 445 450 

acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg 3841 

Thr Pro lie Gly Asp Gly Pro val Leu Leu Pro Asp Asn His Tyr Leu 

455 460 465 

age tac cag tec gee ctg age aaa gac ccc aac gag aag cgc gat cac 3889 

Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

470 475 480 

atg gtc ctg ctg gag ttc gtg acc gec gec ggg ate act etc ggc atg 3937 

Met val Leu Leu Glu Phe val Thr Ala Ala Gly lie Thr Leu Gly Met 

485 490 495 



gac gag ctg tac aag taa agcggccgca ctcgagatat ctagacccag 
Asp Glu Leu Tyr Lys 
500 


3985 


ctttcttgta 


caaagtggtg 


ataacatcga 


taaaataaaa 


gattttattt 


agtctccaga 


4045 


aaaagggggg 


aatgaaagac 


cccacctgta 


ggtttggcaa 


gctagcttaa 


gtaacgecat 


4105 


tttgeaagge 


atggaaaaat 


acataactga 


gaatagagaa 


gttcagatca 


aggtcaggaa 


4165 


cagatggaac 


agctgaatat 


gggecaaaca 


ggatatctgt 


ggtaagcagt 


tcctgccccg 


4225 


gctcagggcc 


aagaacagat 


ggaacagctg 


aatatgggcc 


aaacaggata 


tctgtggtaa 


4285 


gcagttcctg 


ccccggctca 


gggccaagaa 


cagatggtcc 


ecagatgegg 


tccagccctc 


4345 


agcagtttct 


agagaaccat 


cagatgtttc 


cagggtgccc 


caaggacctg 


aaatgaccct 


4405 


gtgecttatt 


tgaactaacc 


aatcagttcg 


cttctcgctt 
Page 


ctgttcgcgc 
39 


gcttctgctc 


4465 
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cccgagctca 


ataaaagagc 


ccacaacccc 


tcactcgggg 


cgccagtcct 


ccgattgact 


4525 


gagtcgcccg 


ggtacccgtg 


tatccaataa 


accctcttgc 


agttgcatcc 


gacttgtggt 


4585 


ctcgctgttc 


cttgggaggg 


tctcctctga 


gtgattgact 


acccgtcagc 


gggggtcttt 


4645 


catttggggg 


ctcgtccggg 


atcgggagac 


ccctgcccag 


ggaccaccga 


cccaccaccg 


4705 


ggaggtaagc 


tggctgcctc 


gcgcgtttcg 


gtgatgacgg 


tgaaaacctc 


tgacacatgc 


4765 


agctcccgga 


gacggtcaca 


gcttgtctgt 


aagcggatgc 


cgggagcaga 


caagcccgtc 


4825 


agggcgcgtc 


agcgggtgtt 


ggcgggtgtc 


ggggcgcagc 


catgacccag 


tcacgtagcg 


4885 


atagcggagt 


gtatactggc 


ttaactatgc 


ggcatcagag 


cagattgtac 


tgagagtgca 


4945 


ccatatgcgg 


tgtgaaatac 


cgcacagatg 


cgtaaggaga 


aaataccgca 


tcaggcgctc 


5005 


ttccgcttcc 


tcgctcactg 


actcgctgcg 


ctcggtcgtt 


cggctgcggc 


gagcggtatc 


5065 


agctcactca 


aaggcggtaa 


tacggttatc 


cacagaatca 


ggggataacg 


caggaaagaa 


5125 


catgtgagca 


aaaggccagc 


aaaaggccag 


gaaccgtaaa 


aaggccgcgt 


tgctggcgtt 


5185 


tttccatagg 


ctccgccccc 


ctgacgagca 


tcacaaaaat 


cgacgctcaa 


gtcagaggtg 


5245 


gcgaaacccg 


acaggactat 


aaagatacca 


ggcgtttccc 


cctggaagct 


ccctcgtgcg 


5305 


ctctcctgtt 


ccgaccctgc 


cgcttaccgg 


atacctgtcc 


gcctttctcc 


cttcgggaag 


5365 


cgtggcgctt 


tctcatagct 


cacgctgtag 


gtatctcagt 


tcggtgtagg 


tcgttcgctc 


5425 


caagctgggc 


tgtgtgcacg 


aaccccccgt 


tcagcccgac 


cgctgcgcct 


tatccggtaa 


5485 


ctatcgtctt 


gagtccaacc 


cggtaagaca 


cgacttatcg 


ccactggcag 


cagccactgg 


5545 


taacaggatt 


agcagagcga 


ggtatgtagg 


cggtgctaca 


gagttcttga 


agtggtggcc 


5605 


taactacggc 


tacactagaa 


ggacagtatt 


tggtatctgc 


gctctgctga 


agccagttac 


5665 


cttcggaaaa 


agagttggta 


gctcttgatc 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


5725 


tttttttgtt 


tgcaagcagc 


agattacgcg 


cagaaaaaaa 


ggatctcaag 


aagatccttt 


5785 


gatcttttct 


acggggtctg 


acgctcagtg 


gaacgaaaac 


tcacgttaag 


ggattttggt 


5845 


catgagatta 


tcaaaaagga 


tcttcaccta 


gatcctttta 


aattaaaaat 


gaagttttaa 


5905 


atcaatctaa 


agtatatatg 


agtaaacttg 


gtctgacag tta cca atg 
Leu Pro Met 


ctt aat 
Leu Asn 


5959 



505 

cag tga gqc acc tat etc age gat ctg tct att teg ttc ate cat agt 6007 
Gin Gly Thr Tyr Leu Ser Asp Leu Ser lie Ser Phe lie His Ser 
510 515 520 

tgc ctg act ccc cgt cgt gta gat aac tac gat acg gga gqg ctt acc 6055 
Cys Leu Thr Pro Arg Arg val Asp Asn Tyr Asp Thr Gly Gly Leu Thr 
525 ~ 530 535 

ate tgg ccc cag tgc tgc aat gat acc gcg aga ccc acg etc acc gqc 6103 
lie Trp Pro Gin Cys Cys Asn Asp Thr Ala Arg Pro Thr Leu Thr Gly 
540 545 550 555 

Page 40 



warf sequence.ST25.txt 

tec aga ttt ate age aat aaa cca gec age egg aag ggc cga gcg cag 6151 

Ser Arg Phe lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin 

560 ' 565 570 

aag tgg tec tgc aac ttt ate cgc etc cat cca gtc tat taa ttg ttg 6199 

Lys Trp Ser Cys Asn Phe lie Arg Leu His Pro val Tyr Leu Leu 

575 580 585 

ccg gga age tag agt aag tag ttc gee agt taa tag ttt gcg caa cgt 6247 

Pro Gly Ser Ser Lys Phe Ala Ser Phe Ala Gin Arg 
590 595 

tgt tgc cat tgc tgc agg cat cgt ggt gtc acg etc gtc gtt tgg tat 6295 

Cys Cys His Cys Cys Arg His Arg Gly Val Thr Leu val val Trp Tyr 

600 605 ~ 610 

ggc ttc att cag etc egg ttc cca acg ate aag gcg agt tac atg ate 6343 

Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala Ser Tyr Met lie 

615 620 625 630 

ccc cat gtt gtg caa aaa age ggt tag etc ctt egg tec tec gat cgt 6391 

Pro His val val Gin Lys Ser Gly Leu Leu Arg Ser Ser Asp Arg 

635 640 " 645 



tgt cag aag taa gtt ggc cgc agt gtt ate act cat ggt tat ggc age 
Cys Gin Lys val Gly Arg Ser Val lie Thr His Gly Tyr Gly Ser 

650 655 660 



ttc tgg gtg age aaa aac agg aag gca aaa tgc cgc aaa aaa ggg aat 
Phe Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn 
755 760 765 



6439 



act gca taa ttc tct tac tgt cat gee ate cgt aag atg ctt ttc tgt 6487 

Thr Ala Phe Ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys 

665 670 675 

gac tgg tga gta etc aac caa gtc att ctg aga ata gtg tat gcg gcg 6535 

Asp Trp Val Leu Asn Gin val lie Leu Arg lie Val Tyr Ala Ala 

680 685 690 

ace gag ttg etc ttg ccc ggc gtc aac acg gga taa tac cgc gee aca 6583 

Thr Glu Leu Leu Leu Pro Gly val Asn Thr Gly Tyr Arg Ala Thr 

695 700 ~ 705 

tag cag aac ttt aaa agt get cat cat tgg aaa acg ttc ttc ggg gcg 6631 

Gin Asn Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala 
710 715 720 

aaa act etc aag gat ctt ace get gtt gag ate cag ttc gat gta ace 6679 

Lys Thr Leu Lys Asp Leu Thr Ala val Glu lie Gin Phe Asp val Thr 

725 730 735 

cac teg tgc ace caa ctg ate ttc age ate ttt tac ttt cac cag cgt 6727 

His Ser Cys Thr Gin Leu lie Phe Ser lie Phe Tyr Phe His Gin Arg 

740 745 750 



6775 



aag ggc gac acg gaa atg ttg aat act cat actcttcctt tttcaatatt 6825 
Lys Gly Asp Thr Glu Met Leu Asn Thr His 
770 775 

attgaagcat ttatcagggt tattgtctca tgageggata catatttgaa tgtatttaga 6885 
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aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag 



aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 



ttcaa 



6945 
7005 
7010 



<210> 41 

<211> 264 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 41 

Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val 
1 5 10 15 

Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser 
20 25 30 

Asp Ala Ala Val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 
35 40 45 

Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 

Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly val Pro Cys Ala Ala val 
65 70 75 80 

Leu Asp val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 

Val Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
100 105 110 

val Ser lie Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 
115 120 ~ 125 

Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg lie Glu Arg Ala 
130 135 140 

Arg Thr Arg Met Glu Ala Gly Leu val Asp Gin Asp Asp Leu Asp Glu 
145 150 155 160 

Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 
165 170 ~ 175 

Arg Met Pro Asp Gly Glu Asp Leu Val val Thr His Gly Asp Ala Cys 
180 185 190 
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Leu Pro Asn lie Met val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp 
195 200 205 

Cys Gly Arg Leu Gly val Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala 
210 215 " 220 

Thr Arg Asp lie Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 
225 ^ 230 235 240 

Leu val Leu Tyr Gly lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe 
245 250 255 

Tyr Arg Leu Leu Asp Glu Phe Phe 
260 



<210> 42 

<211> 239 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic construct 
<400> 42 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly val val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp val Asn Gly His Lys Phe Ser val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro val Pro Trp Pro Thr Leu val Thr Thr 

50 ' ' 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

val Lys Phe Glu Gly Asp Thr Leu val Asn Arg lie Glu Leu Lys Gly 
115 120 " 125 
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lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

145 150 ' 155 160 

Gly lie Lys val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met val Leu Leu Glu Phe 

210 215 ~ 220 

val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 

225 230 235 



<210> 43 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 43 

Leu Pro Met Leu Asn Gin 
1 5 



<210> 44 

<211> 75 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 44 

Gly Thr Tyr Leu Ser Asp Leu Ser lie Ser Phe He His Ser Cys Leu 
15 10 15 

Thr Pro Arg Arg val Asp Asn Tyr Asp Thr Gly Gly Leu Thr lie Trp 
20 25 30 



Pro Gin Cys Cys Asn Asp Thr Ala Arg pro Thr Leu Thr Gly Ser Arg 
35 ' 40 45 
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Phe lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin Lys Trp 

50 55 '60 

Ser Cys Asn Phe lie Arg Leu His Pro Val Tyr 
65 70 75 



<210> 45 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 45 

Leu Leu Pro Gly Ser 
1 5 



<210> 46 

<211> 44 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 46 

Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly val Thr Leu 
1 5 10 15 

val Val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala 
20 25 30 

Ser Tyr Met lie Pro His Val Val Gin Lys Ser Gly 
35 40 

<210> 47 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 47 

Leu Leu Arg Ser Ser Asp Arg Cys Gin Lys 
1 5 " 10 



<210> 48 
<211> 14 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic Construct 
<400> 48 

val Gly Arg Ser val lie Thr His Gly Tyr Gly Ser Thr Ala 
15 10 



<210> 49 

<211> 15 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 49 

Phe Ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys Asp Trp 
15 10 15 



<210> 50 

<211> 24 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 50 

val Leu Asn Gin val lie Leu Arg lie val Tyr Ala Ala Thr Glu Leu 
1 5 10 15 



Leu Leu Pro Gly val Asn Thr Gly 
20 



<210> 51 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 51 

Tyr Arg Ala Thr 



<210> 52 

<211> 73 

<212> PRT 

<213> Artificial 



<220> 



Page 46 



warf Sequence.ST25.txt 

<223> Synthetic Construct 
<400> 52 

Gin Asn Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala Lys 
15 10 15 

Thr Leu Lys Asp Leu Thr Ala Val Glu lie Gin Phe Asp val Thr His 
20 25 30 

Ser Cys Thr Gin Leu lie Phe Ser lie Phe Tyr Phe His Gin Arg Phe 
35 40 45 

Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn Lys 
50 55 60 

Gly Asp Thr Glu Met Leu Asn Thr His 
65 70 

<210> 53 

<211> 7121 

<212> DNA 

<213> Artificial 

<220> 

<223> Engineered Sequence from virus and plasmid 
<220> 

<221> LTR 

<222> (149).. (737) 

<223> 5' MoMuSVLTR 

<220> 

<221> CDS 

<222> (1660) . . (2454) 

<223> neomycin resistance 

<220> 

<221> promoter 

<222> (2806). .(3261) 

<223> HIV promoter 

<220> 

<221> misc_feature 

<222> (3281). .(3305) 

<223> attBl 

<220> 

<221> CDS 

<222> (3347). .(4066) 

<223> EYFP ; enhanced yellow florescent protein 
<220> 

<221> misc_feature 

<222> (4091) . . (4115) 

<223> attB2 
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<220> 

<221> LTR 

<222> (4167) . . (4760) 

<223> 3' MoMuLVLTR 

<220> 

<221> CDS 

<222> (6056) . . (6916) 

<223> ampicillin resistance 

<400> 53 



gaattaattc 


ataccagatc 


accgaaaact 


gtcctccaaa 


tgtgtccccc 


tcacactccc 


60 


aaattcgcgg 


gcttctgcct 


cttagaccac 


tctaccctat 


tccccacact 


caccggagcc 


120 


aaagccgcgg 


cccttccgtt 


tctttgcttt 


tgaaagaccc 


cacccgtagg 


tggcaagcta 


180 


gcttaagtaa 


cgccactttg 


caaggcatgg 


aaaaatacat 


aactgagaat 


agaaaagttc 


240 


agatcaaggt 


caggaacaaa 


gaaacagctg 


aataccaaac 


aggatatctg 


tggtaagcgg 


300 


ttcctgcccc 


ggctcagggc 


caagaacaga 


tgagacagct 


gagtgatggg 


ccaaacagga 


360 


tatctgtggt 


aagcagttcc 


tgccccggct 


cggggccaag 


aacagatggt 


ccccagatgc 


420 


ggtccagccc 


tcagcagttt 


ctagtgaatc 


atcagatgtt 


tccagggtgc 


cccaaggacc 


480 


tgaaaatgac 


cctgtacctt 


atttgaacta 


accaatcagt 


tcgcttctcg 


cttctgttcg 


540 


cgcgcttccg 


ctctccgagc 


tcaataaaag 


agcccacaac 


ccctcactcg 


gcgcgccagt 


600 


cttccgatag 


actgcgtcgc 


ccgggtaccc 


gtattcccaa 


taaagcctct 


tgctgtttgc 


660 


atccgaatcg 


tggtctcgct 


gttccttggg 


agggtctcct 


ctgagtgatt 


gactacccac 


720 


gacgggggtc 


tttcatttgg 


gggctcgtcc 


gggatttgga 


gacccctgcc 


cagggaccac 


780 


cgacccacca 


ccgggaggta 


agctggccag 


caacttatct 


gtgtctgtcc 


gattgtctag 


840 


tgtctatgtt 


tgatgttatg 


cgcctgcgtc 


tgtactagtt 


agctaactag 


ctctgtatct 


900 


ggcggacccg 


tggtggaact 


gacgagttct 


gaacacccgg 


ccgcaaccct 


gggagacgtc 


960 


ccagggactt 


tgggggccgt 


ttttgtggcc 


cgacctgagg 


aagggagtcg 


atgtggaatc 


1020 


cgaccccgtc 


aggatatgtg 


gttctggtag 


gagacgagaa 


cctaaaacag 


ttcccgcctc 


1080 


cgtctgaatt 


tttgctttcg 


gtttggaacc 


gaagccgcgc 


gtcttgtctg 


ctgcagcgct 


1140 


gcagcatcgt 


tctgtgttgt 


ctctgtctga 


ctgtgtttct 


gtatttgtct 


gaaaattagg 


1200 


gccagactgt 


taccactccc 


ttaagtttga 


ccttaggtca 


ctggaaagat 


gtcgagcgga 


1260 


tcgctcacaa 


ccagtcggta 


gatgtcaaga 


agagacgttg 


ggttaccttc 


tgctctgcag 


1320 


aatggccaac 


ctttaacgtc 


ggatggccgc 


gagacggcac 


ctttaaccga 


gacctcatca 


1380 


cccaggttaa 


gatcaaggtc 


ttttcacctg 


gcccgcatgg 


acacccagac 


caggtcccct 


1440 


acatcgtgac 


ctgggaagcc 


ttggcttttg 


acccccctcc 


ctgggtcaag 


ccctttgtac 


1500 


accctaagcc 


tccgcctcct 


cttcctccat 


ccgccccgtc 


tctccccctt 


gaacctcctc 


1560 
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gttcgacccc gcctcgatcc tccctttatc cagccctcac tccttctcta ggcgccggaa 1620 

ttccgatctg atcaagagac aggatgagga tcgtttcgc atg att gaa caa gat 1674 

Met lie Glu Gin Asp 
1 5 

gqa ttg cac gca gqt tct ccg gcc get tgg gtq gag agg eta ttc gqc 1722 

Gly Leu His Ala Gly Ser Pro Ala Ala Trp val Glu Arg Leu Phe Gly 

10 15 20 

tat gac tgg gca caa cag aca ate gqc tgc tct gat gcc gcc gtq ttc 1770 

Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser Asp Ala Ala Val Phe 

25 30 35 

egg ctg tea gcg cag ggg cgc ccg gtt ctt ttt gtc aag ace gac ctg 1818 

Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe val Lys Thr Asp Leu 

40 45 50 

tec gqt gcc ctg aat gaa ctg cag gac gag gca gcg egg eta teg tgg 1866 

Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala Ala Arg Leu Ser Trp 

55 60 65 

ctg gcc acg acg gqc gtt cct tgc gca get gtq etc gac gtt gtc act 1914 

Leu Ala Thr Thr Gly Val Pro Cys Ala Ala val Leu Asp val Val Thr 

70 75 80 85 

gaa gcg gqa agg gac tgg ctg eta ttg gqc gaa gtq ccg ggg cag gat 1962 

Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu val Pro Gly Gin Asp 

90 95 100 

etc ctg tea tct cac ctt get cct gcc gag aaa gta tec ate atg get 2010 

Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys Val Ser lie Met Ala 

105 110 ' 115 

gat gca atg egg egg ctg cat acg ctt gat ccg get acc tgc cca ttc 2058 

Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro Ala Thr Cys Pro Phe 

120 ~ 125 130 

gac cac caa gcg aaa cat cgc ate gag cga gca cgt act egg atg gaa 2106 

Asp His Gin Ala Lys His Arg lie Glu Arg Ala Arg Thr Arg Met Glu 

135 140 145 

gcc gqt ctt gtc gat cag gat gat ctg gac gaa gag cat cag ggg etc 2154 

Ala Gly Leu val Asp Gin Asp Asp Leu Asp Glu Glu His Gin Gly Leu 

150 155 160 165 

gcg cca gcc gaa ctg ttc gcc agg etc aag gcg cgc atg ccc gac gqc 2202 

Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala Arg Met Pro Asp Gly 

170 175 180 

gag gat etc gtc gtq acc cat gqc gat gcc tgc ttg ccg aat ate atg 2250 

Glu Asp Leu Val val Thr His Gly Asp Ala Cys Leu Pro Asn lie Met 

185 190 195 

gtq gaa aat gqc cgc ttt tct gqa ttc ate gac tgt gqc egg ctg gqt 2298 

val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp Cys Gly Arg Leu Gly 

200 " 205 210 

gtq gcg gac cgc tat cag gac ata gcg ttg get acc cgt gat att get 2346 

val Ala Asp Arg Tyr Gin Asp He Ala Leu Ala Thr Arg Asp lie Ala 

215 220 225 

gaa gag ctt ggc ggc gaa tgg get gac cgc ttc etc gtg ctt tac ggt 2394 
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Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe Leu val Leu Tyr Gly 
230 235 240 245 

ate gec get ccc gat teg cag cgc ate gee ttc tat cgc ctt ctt gac 2442 
lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe Tyr Arg Leu Leu Asp 
250 255 260 

gag ttc ttc tga gegggactet ggggttcgaa atgaccgacc aagcgacgcc 2494 
Glu Phe Phe 



caacctgcca 


tcacgagatt 


tcgattccac 


cgccgccttc 


tatgaaaggt 


tgggcttegg 


2554 


aatcgttttc 


cgggacgccg 


gctggatgat 


cctccagcgc 


ggggatctca 


tgctggagtt 


2614 


cttcgcccac 


cccgggctcg 


atcccctcgc 


gagttggttc 


agctgctgcc 


tgaggctgga 


2674 


cgacctcgcg 


gagttctacc 


ggcagtgcaa 


atccgtcggc 


atccaggaaa 


ccagcagcgg 


2734 


ctatccgcgc 


atccatgccc 


ccgaactgca 


ggagtgggga 


ggcacgatgg 


ccgctttggt 


2794 


egaggeggat 


cctggaaggg 


ctaatttggt 


cccaaagaag 


acaagagatc 


cttgatctgt 


2854 


ggatctacca 


cacacaaggc 


tacttccctg 


attggcagaa 


ttacacacca 


gggccaggga 


2914 


tcagatatcc 


actgaccttt 


ggatggtgct 


tcaagctagt 


accagttgag 


ccagagaagg 


2974 


tagaagaggc 


caatgaagga 


gagaacaaca 


gcttgttaca 


ccctatgagc 


ctgcatggga 


3034 


tggaggaege 


ggagaaagaa 


gtgttagtgt 


ggaggtttga 


cagcaaacta 


gcatttcatc 


3094 


acatggcccg 


agagctgeat 


ceggagtact 


acaaagactg 


ctgacatcga 


gctttctaca 


3154 


agggactttc 


cgctggggac 


tttccaggga 


ggcgtggcct 


gggegggact 


ggggagtggc 


3214 


gtccctcaga 


tgetgeatat 


aagcagctgc 


tttttgcctg 


tactgggect 


cgagaagctt 


3274 


gttatcacaa 


gtttgtacaa 


aaaagcaggc 


ttcgaaggag 


atagaaccaa 


ttctctaagg 


3334 


aaatacttaa 


cc atg gtg 
Met Val 
265 


age aag gg< 
Ser Lys G\) 


: gag gag ctg ttc acc gqg gtg gtg 
f Glu Glu Leu Phe Thr Gly val val 
270 275 


3385 



ccc ate ctg gtc gag ctg gac gqc gac gta aac gqc cac aag ttc age 3433 

Pro lie Leu val Glu Leu Asp Gly Asp val Asn Gly His Lys Phe Ser 

280 285 290 

gtg tec gqc gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg 3481 

val ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 

295 300 305 

aag ttc ate tgc acc acc gqc aag ctg ccc gtg ccc tgg ccc acc etc 3529 

Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 

310 315 320 325 

gtg acc acc ttc ggc tac gqc ctg cag tgc ttc gec cgc tac ccc gac 3577 

val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp 
330 335 340 

cac atg aag cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac 3625 

His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

345 350 355 
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ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag 
Gly Asp Gly Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin 
455 460 465 
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gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc 3673 

val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

360 365 370 

cgc gec gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag 3721 

Arg Ala Glu val Lys Phe Glu Gly Asp Thr Leu val Asn Arg lie Glu 

375 380 385 

ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag 

Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys 

390 395 400 405 

ctg gag tac aac tac aac age cac aac gtc tat ate atg gec gac aag 3817 

Leu Glu Tyr Asn Tyr Asn Ser His Asn val Tyr lie Met Ala Asp Lys 
410 415 420 

cag aag aac ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag 3865 

Gin Lys Asn Gly lie Lys val Asn Phe Lys lie Arg His Asn lie Glu 

425 430 435 

gac ggc age gtg cag etc gee gac cac tac cag cag aac acc ccc ate 3913 

Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 

440 445 450 



3961 



tec gec ctg age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg 4009 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met val Leu 

470 475 480 485 

ctg gag ttc gtg acc gec gec ggg ate act etc ggc atg gac gag ctg 4057 

Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu 

490 495 500 

tac aag taa agcggccgca ctcgagatat ctagacccag ctttcttgta 4106 
Tyr Lys 



caaagtggtg 


ataacatcga 


taaaataaaa 


gattttattt 


agtctccaga 


aaaagggggg 


4166 


aatgaaagac 


cccacctgta 


ggtttggcaa 


gctagcttaa 


gtaacgecat 


tttgeaagge 


4226 


atggaaaaat 


acataactga 


gaatagagaa 


gttcagatca 


aggtcaggaa 


cagatggaac 


4286 


agctgaatat 


gggecaaaca 


ggatatctgt 


ggtaagcagt 


tcctgccccg 


gctcagggcc 


4346 


aagaacagat 


ggaacagctg 


aatatgggcc 


aaacaggata 


tctgtggtaa 


gcagttcctg 


4406 


ccccggctca 


gggccaagaa 


cagatggtcc 


ecagatgegg 


tccagccctc 


agcagtttct 


4466 


agagaaccat 


cagatgtttc 


cagggtgccc 


caaggacctg 


aaatgaccct 


gtgecttatt 


4526 


tgaactaacc 


aatcagttcg 


cttctcgctt 


ctgttcgcgc 


gcttctgctc 


cccgagctca 


4586 


ataaaagagc 


ccacaacccc 


teactegggg 


cgccagtcct 


ccgattgact 


gagtcgcccg 


4646 


ggtacccgtg 


tatccaataa 


accctcttgc 


agttgcatcc 


gacttgtggt 


ctcgctgttc 


4706 


cttgggaggg 


tctcctctga 


gtgattgact 


acccgtcagc 


gggggtcttt 


catttggggg 


4766 


ctcgtccggg 


ategggagae 


ccctgcccag 


ggaccaccga 
Page 


cccaccaccg 
51 


ggaggtaagc 


4826 
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tggctgcctc gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga 4886 

gacggtcaca gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc 4946 

agcgggtgtt ggcgggtgtc ggggcgcagc catgacccag tcacgtagcg atagcggagt 5006 

gtatactggc ttaactatgc ggcatcagag cagattgtac tgagagtgca ccatatgcgg 5066 

tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgctc ttccgcttcc 5126 

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 5186 

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 5246 

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 5306 

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 5366 

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 5426 

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 5486 

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 5546 

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 5606 

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 5666 

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 5726 

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 5786 

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 5846 

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 5906 

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 5966 

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 6026 

agtatatatg agtaaacttg gtctgacag tta cca atg ctt aat cag tga gqc 6079 

Leu Pro Met Leu Asn Gin Gly 
505 510 

acc tat etc age gat ctg tct att teg ttc ate cat agt tgc ctg act 6127 
Thr Tyr Leu Ser Asp Leu Ser lie Ser Phe lie His Ser Cys Leu Thr 
515 520 525 

ccc cgt cgt gta gat aac tac gat acg gga ggg ctt acc ate tgg ccc 6175 
Pro Arg Arg val Asp Asn Tyr Asp Thr Gly Gly Leu Thr lie Trp Pro 
530 535 540 

cag tgc tgc aat gat acc gcg aga ccc acg etc acc gqc tec aga ttt 6223 
Gin Cys Cys Asn Asp Thr Ala Arg Pro Thr Leu Thr Gly Ser Arg Phe 
545 550 555 

ate age aat aaa cca gee age egg aag gqc cga gcg cag aag tgg tec 6271 
lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin Lys Trp Ser 
560 565 570 

tgc aac ttt ate cgc etc cat cca gtc tat taa ttg ttg ccg gga age 6319 
Cys Asn Phe lie Arg Leu His Pro val Tyr Leu Leu Pro Gly ser 
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575 580 585 

tag agt aag tag ttc gcc agt taa tag ttt gcg caa cgt tgt tgc cat 6367 

Ser Lys Phe Ala Ser Phe Ala Gin Arg Cys Cys His 

590 595 600 

tgc tgc agg cat cgt gqt gtc acg etc gtc gtt tgg tat ggc ttc att 6415 

Cys Cys Arg His Arg Gly Val Thr Leu val val Trp Tyr Gly Phe lie 

605 610 615 

cag etc egg ttc cca acg ate aag gcg agt tac atg ate ccc cat gtt 6463 

Gin Leu Arg Phe Pro Thr lie Lys Ala Ser Tyr Met lie Pro His val 

620 625 630 

gtg caa aaa age gqt tag etc ctt egg tec tec gat cgt tgt cag aag 6511 

vat Gin Lys Ser Gly Leu Leu Arg Ser ser Asp Arg Cys Gin Lys 

635 640 645 

taa gtt ggc cgc agt gtt ate act cat gqt tat ggc age act gca taa 6559 

val Gly Arg Ser val lie Thr His Gly Tyr Gly Ser Thr Ala 

650 655 660 

ttc tct tac tgt cat gcc ate cgt aag atg ctt ttc tgt gac tgg tga 6607 

Phe ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys Asp Trp 

665 670 675 

gta etc aac caa gtc att ctg aga ata gtg tat gcg gcg ace gag ttg 6655 

val Leu Asn Gin val lie Leu Arg lie val Tyr Ala Ala Thr Glu Leu 

680 685 690 

etc ttg ccc ggc gtc aac acg gga taa tac cgc gcc aca tag cag aac 6703 

Leu Leu Pro Gly val Asn Thr Gly Tyr Arg Ala Thr Gin Asn 

695 700 705 

ttt aaa agt get cat cat tgg aaa acg ttc ttc ggg gcg aaa act etc 6751 

Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala Lys Thr Leu 

710 715 720 

aag gat ctt acc get gtt gag ate cag ttc gat gta acc cac teg tgc 6799 

Lys Asp Leu Thr Ala Val Glu lie Gin Phe Asp val Thr His Ser Cys 

725 730 735 

acc caa ctg ate ttc age ate ttt tac ttt cac cag cgt ttc tgg gtg 6847 

Thr Gin Leu lie Phe Ser lie Phe Tyr Phe His Gin Arg Phe Trp Val 

740 745 750 ~ 755 

age aaa aac agg aag gca aaa tgc cgc aaa aaa ggg aat aag ggc gac 6895 

Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn Lys Gly Asp 

760 ' 765 770 

acg gaa atg ttg aat act cat actcttcctt tttcaatatt attgaagcat 6946 
Thr Glu Met Leu Asn Thr His 
775 

ttatcagggt tattgtctca tgageggata catatttgaa tgtatttaga aaaataaaca 7006 

aataggggtt ccgcgcacat ttccccgaaa agtgccacct gaegtctaag aaaccattat 7066 

tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc ttcaa 7121 



<210> 54 
<211> 264 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 54 

Met lie Glu Gin Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp val 
1 5 10 15 

Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gin Gin Thr lie Gly Cys Ser 
20 25 30 

Asp Ala Ala Val Phe Arg Leu Ser Ala Gin Gly Arg Pro Val Leu Phe 
35 40 45 

val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gin Asp Glu Ala 
50 55 60 

Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly val Pro Cys Ala Ala val 
65 ~ 70 75 80 

Leu Asp val val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu 
85 90 95 

val Pro Gly Gin Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys 
100 105 110 



val Ser lie Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro 

115 120 ^ 125 

Ala Thr Cys Pro Phe Asp His Gin Ala Lys His Arg lie Glu Arg Ala 

130 135 140 

Arg Thr Arg Met Glu Ala Gly Leu val Asp Gin Asp Asp Leu Asp Glu 

145 150 155 160 

Glu His Gin Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala 

165 170 - 175 



Arg Met Pro Asp Gly Glu Asp Leu val val Thr His Gly Asp Ala Cys 
180 185 190 

Leu Pro Asn lie Met Val Glu Asn Gly Arg Phe Ser Gly Phe lie Asp 
195 200 205 

Cys Gly Arg Leu Gly val Ala Asp Arg Tyr Gin Asp lie Ala Leu Ala 
210 215 220 
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Thr Arg Asp lie Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe 

225 230 235 240 

Leu val Leu Tyr Gly lie Ala Ala Pro Asp Ser Gin Arg lie Ala Phe 
245 250 255 

Tyr Arg Leu Leu Asp Glu Phe Phe 
260 



<210> 55 

<211> 239 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 55 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp val Asn Gly His Lys Phe Ser val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 ' 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 ' 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

val Lys Phe Glu Gly Asp Thr Leu val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly lie Lys val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 



val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 56 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 56 

Leu Pro Met Leu Asn Gin 
1 5 



<210> 57 

<211> 75 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 57 

Gly Thr Tyr Leu Ser Asp Leu Ser lie Ser Phe lie His Ser Cys Leu 
15 10 15 

Thr Pro Arg Arg val Asp Asn Tyr Asp Thr Gly Gly Leu Thr lie Trp 
20 25 30 

Pro Gin cys Cys Asn Asp Thr Ala Arg Pro Thr Leu Thr Gly Ser Arg 
35 40 45 

Phe lie Ser Asn Lys Pro Ala Ser Arg Lys Gly Arg Ala Gin Lys Trp 
50 55 ~ 60 

Ser Cys Asn Phe lie Arg Leu His Pro val Tyr 
65 70 75 
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<210> 58 
<211> 5 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 58 

Leu Leu Pro Gly Ser 
1 5 



<210> 59 

<211> 44 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 59 

Phe Ala Gin Arg Cys Cys His Cys Cys Arg His Arg Gly val Thr Leu 
1 5 10 15 



val val Trp Tyr Gly Phe lie Gin Leu Arg Phe Pro Thr lie Lys Ala 
20 25 30 



Ser Tyr Met lie Pro His val val Gin Lys Ser Gly 
35 40 



<210> 60 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 60 

Leu Leu Arg Ser Ser Asp Arg Cys Gin Lys 
1 5 10 



<210> 61 

<211> 14 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 61 

val Gly Arg Ser val lie Thr His Gly Tyr Gly Ser Thr Ala 
1 5 10 
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<210> 62 

<211> 15 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 62 

Phe Ser Tyr Cys His Ala lie Arg Lys Met Leu Phe Cys Asp Trp 
15 10 15 



<210> 63 

<211> 24 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 63 

val Leu Asn Gin val lie Leu Arg lie val Tyr Ala Ala Thr Glu Leu 
1 5 "10 15 



Leu Leu Pro Gly Val Asn Thr Gly 
20 



<210> 64 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 64 

Tyr Arg Ala Thr 
1 



<210> 65 

<211> 73 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 65 

Gin Asn Phe Lys Ser Ala His His Trp Lys Thr Phe Phe Gly Ala Lys 
15 10 15 



Thr Leu Lys Asp Leu Thr Ala val Glu lie Gin Phe Asp val Thr His 

Page 58 



Warf Sequence.ST25.txt 
20 25 30 

Ser Cys Thr Gin Leu lie Phe ser lie Phe Tyr Phe His Gin Arg Phe 
35 40 45 

Trp val Ser Lys Asn Arg Lys Ala Lys Cys Arg Lys Lys Gly Asn Lys 
50 55 60 

Gly Asp Thr Glu Met Leu Asn Thr His 
65 70 
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